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1. WHAT IS ARITHMETIC GEOMETRY? 


Algebraic geometry studies the set of solutions of a multivariable polynomial equation 


(or a system of such equations), usually over R or C. For instance, 2? + ry — 5y? = 1 
defines a hyperbola. It uses both commutative algebra (the theory of commutative rings) 
and geometric intuition. 

Arithmetic geometry is the same except that one is interested instead in the solutions 
where the coordinates lie in other fields that are usually far from being algebraically closed. 


Fields of special interest are Q (the field of rational numbers) and F,, (the finite field of p 


elements), and their finite extensions. Also of interest are solutions with coordinates in Z 
(the ring of integers). 


Example 1.1. The circle x? + y? = 1 has infinitely many rational points, such as (3/5, 4/5). 
Finding them all is essentially the same as finding all Pythagorean triples. 


Example 1.2. The circle x? + y? = 3 has no rational points at all! 


Example 1.3. The curve x‘ + y* = 1 has exactly four rational points, namely (+1,0) and 
(0,41). This is the exponent 4 case of Fermat’s Last Theorem: this case was proved by 
Fermat himself. 


We'll develop methods for explaining things like this. 


2. ABSOLUTE VALUES ON FIELDS 


One approach to constructing the field Q,, of p-adic numbers is to copy the construction 


of R, but with a twist: the usual absolute value is replaced by an exotic measure of size. 


Definition 2.1. An absolute value on a field & is a function 


k-> R>o 


xr || 


such that the following hold for x,y € k: 
(Abs1)  ||z|| = 0 if and only if « = 0 
(Abs2) ||zy|| = ||x|| - lly 


(Abs3) |x + yll < [lll + |lyl] (“triangle inequality”) 


Examples: 
e R with the usual | | 
e C with the usual | | (or any subfield of this) 
e any field k with 


1, toe U 
I|z]| = 
0, if eG. 


This is called the trivial absolute value. 


Definition 2.2. An absolute value || || satisfying 
(Abs3’) ||z + y|| < max(|l2'||, ||y|]) | (“nonarchimedean triangle inequality” ) 


is said to be nonarchimedean. Otherwise it is said to be archimedean. 


(Abs3’) is more restrictive than (Abs3), since max(||x||, ||y||) < |||] + |]yl|- 

(Abs3’) is strange from the point of view of classical analysis: it says that if you add many 
copies of a “small” number, you will never get a “large” number, no matter how many copies 
you use. This is what gives p-adic analysis its strange flavor. 

Of the absolute values considered so far, only the trivial absolute value is nonarchimedean. 
But we will construct others soon. In fact, most absolute values are nonarchimedean! 


3. THE p-ADIC ABSOLUTE VALUE ON Q 


The fundamental theorem of arithmetic (for integers) implies that every nonzero rational 
number « can be factored as 


awe [[e” =n a a 
P 
where u € {1,—1}, and n, € Z for each prime p, and n, = 0 for almost all p (so that all but 
finitely many factors in the product are 1, making it a finite product). 


Definition 3.1. Fix a prime p. The p-adic valuation is the function 
Up: Q* + Z 
CS elt) Sts, 


that gives the exponent of p in the factorization of a nonzero rational number x. If « = 0, 
then by convention, v,(0) := +oo. Sometimes the function is called ord, instead of vy. 


Another way of saying the definition: If x is a nonzero rational number, it can be written in 
the form p"—, where r and s are integers not divisible by p, and n € Z, and then v,(x) := n. 
S 


Example 3.2. We have v2(5/24) = —3, since 5/24 = 2732 = 27937151. 
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Properties: 
(Val1) v,(x) = +00 if and only if « =0 
(Val2) v,(zy) = vp(x) + vp(y) 
(Val3) vp(e + y) = min(vp(2), vp(y)) 
These hold even when x or y is 0, as long as one uses reasonable conventions for +00, 
namely: 
e (+00) +a = +00 
e+ora 
e min(+00o,a) =a 
for any a, including a = +oo. 
Property (Val2) says that if we disregard the input 0, then v, is a homomorphism from 
the multiplicative group Q* to the additive group Z. 


Proof of (Val3). The cases where x = 0 or y = 0 or x + y = 0 are easy, so assume that 2, y, 


and «+ y are all nonzero. Write 
nl m 
=p" (and) y=pP 


with r,s,u,v not divisible by p, so u,(x) = n and v,(y) = m. Without loss of generality, 
assume that n <m. Then 


Here sv is not divisible by p, but N might be so N might contribute some extra factors of 


p. Thus all we can say is that 


vp(w + y) > n = min(n,m) = min(v,(z), »,(y)). 


Definition 3.3. Fix a prime p. The p-adic absolute value of a rational number « is defined 
by 
Z|p = pee. 


If c = 0 (i-e., vp(x) = +00), then we interpret this as |0|, := 0. 


Properties (Vall), (Val2), (Val3) for uv, are equivalent to properties (Abs1), (Abs2), (Abs3’) 
for | |p. In particular, | |, really is an absolute value on Q. 


4. OSTROWSKI’S CLASSIFICATION OF ABSOLUTE VALUES ON Q 


On Q we now have absolute values | |2, | |3, | |5, ..., and the usual absolute value | |, 
which is also denoted | |,., for reasons having to do with an analogy with function fields that 


we will not discuss now. Ostrowski’s theorem says that these are essentially all of them. 
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Definition 4.1. Two absolute values || || and || ||’ on a field & are said to be equivalent if 
there is a positive real number a such that 


Ilz|l’ = |lx|1° 
for all xz € k. 


Theorem 4.2 (Ostrowski). Every nontrivial absolute value on Q is equivalent to | |p for 
some p< Hw. 


Proof. Let || || be the absolute value. 

Case 1: there exists a positive integer b with |\b|| > 1. Let b be the smallest such positive 
integer. Since ||1|| = 1, it must be that 6 > 1. Let a@ be the positive real number such that 
||b|| = 6°. Any other positive integer n can be written in base b: 


n=aj+a;b+---+a,b° 
where 0 < a; < 6 for all i, and a, # 0. Then 
IIl] < llaol] + |lax5|| + llazb?|] +--+ + |lasb*|| 
= |[ao|| + |]ar||6* + |Jaa||b°* +--+ + |]as|]b°* 
<14+5%4+ 0°74 ---4+5% (by definition of b, since 0 < a; < b) 
=(14+0%+b %4---+8-%) b 
<Cn* (since b° <n), 
where C’ is the value of the convergent infinite geometric series 
1+b°%+b 4+... 
This holds for all n, so for any N > 1 we can substitute n™ in place of n to obtain 
IIn™ |] <C(n™)2, 
which implies 
InP < Cn)” 
In|] < CV ne. 
This holds for all N > 1, and C/N > 1 as N > oo, so we obtain 
|r|] < n* 


for each n > 1. 


We next prove the opposite inequality ||n|| > nm for all positive integers n. Given n, 
choose an integer s such that b° <n < b’t!. Then 


O°" || < [lal] + [0° = nl 


sO 


[Jr] = [br] — []be** = nl] 


= pista _ |p2+l _ pl (since ||b|| = b%) 


> p(stt a (ae —_ 1 a (by the previous paragraph) 


> pst a (ot ie aaa (since bs <i ne ae 


-em f(A) 
“tor (4) 


=n: 


where c is a positive real number independent of n. This inequality, ||n|| > cn® holds for all 
positive integers n, so as before, we may substitute n = n%, take N‘ roots, and take the 
limit as N — oo to deduce 
I|n|] = n®. 
Combining the previous two paragraphs yields ||n|| = n° for any positive integer n. If m 
is another positive integer, then 


[|r|] - [reff] = [Im] 


||m/nl| = |]m]/I[n|] = m*/n® = (m/n)*. 


Thus ||q|| = ¢° for every positive rational number. Finally, if q is a positive rational number, 
then 

l| — all = | — 1 - llall = @* =|-al® 
so ||x|| = |x|* holds for all x € Q (including 0). 

Case 2: ||b|| = 1 for all positive integers b. Then as in the previous paragraph, the axioms 
of absolute values imply that ||z|| = 1 for all x € Q*, contradicting the assumption that || || 
is a nontrivial absolute value. 

Case 3: ||n|| <1 for all positive integers n, and there exists a positive integer b such that 
\|b|| < 1. Assume that b is the smallest such integer. If it were possible to write b = rs for 
some smaller positive integers r and s, then ||r| = 1 and ||s|| = 1 by definition of b, but then 
\|b|| = ||r|| - ||s|] = 1, a contradiction; thus 6 is a prime p. 

We prove (by contradiction) that p is the only prime satisfying ||p|| < 1. Suppose that q 
were another such prime. For any positive integer N, the integers p% and q are relatively 
prime, so there exist integers u,v such that 


up” +uqn =1, 
i 


and then 
1 = ||| = |lup* + vg" | 
< full -lp® + [lol - Nall 
< |p|!” + |lall”. 
This is a contradiction if N is large enough. So ||q|| = 1 for every prime q # p. 


Since 0 < ||p|| < 1 and 0 < |p|, < 1, there exists a positive real number a such that 
Ilp|| = |p|$. Now, for any nonzero rational number 
i I] q’" 
primes q including p 
property (Abs2) (and || — 1|| = 1) imply 
|| || = II llall"* = Ilpl|"* 


primes q including p 


a 


since all the other factors are 1. Since ||p|| = |p|}, this becomes 


I|ar|| = [plp?* = [Ip 


5. CAUCHY SEQUENCES AND COMPLETION 
Let k be a field equipped with an absolute value |] |]. 


Definition 5.1. A sequence (a;) in k& converges if there exists € € k such that for every 
€ > 0, the terms a; are eventually within ¢ of @: i.e., for every € > 0, there exists a positive 
integer N such that for all i > N, the distance bound |la; — ¢|| < € holds. In this case, @ is 
called the limit of the sequence. 


Equivalently (a;) converges to @ if and only if ||a; — ¢|| > 0 as i > oo. The limit is unique 
if it exists: if (a;) converges to both @ and @’, then 


|e’ — el] < lla; — 2|| + lla: — e|| 3 04+0=0, 
so ||’ — €\| =0, so l=. 
Definition 5.2. A sequence (a;) in k is a Cauchy sequence if for every € > 0, the terms are 


eventually within € of each other; i.e., for every € > 0, there exists a positive integer N such 
that for all i,7 > N, the distance bound ||a; — a;|| < € holds. 


Proposition 5.3. If a sequence converges, it is a Cauchy sequence. 


Proof. Use the triangle inequality. 


Unfortunately, the converse can fail. 


Definition 5.4. A field k is complete with respect to |] || if every Cauchy sequence converges. 


We would like every Cauchy sequence to converge, but this might not be the case. To 
fix this, for each Cauchy sequence that does not converge, we could formally create a new 
symbol that represents the limit and treat it as if it were a new number. But some Cauchy 
sequences look as if they should be converging to the same limit, so we need to identify 
some of these symbols. So the new symbols really should correspond to equivalence classes 
of Cauchy sequences that do not converge. Actually there is no harm in creating symbols 
for Cauchy sequences that converge already, as long as these new symbols are identified with 
the pre-existing limits. Finally, we can think of the equivalence classes themselves as being 
the symbols. 


Definition 5.5. Two sequences (a;) and (b;) are equivalent if ||a; — b;|| > 0 as i > oo. 


One can check that this induces an equivalence relation on the set of sequences. Any 
sequence equivalent to a Cauchy sequence is also a Cauchy sequence. 


Definition 5.6. The completion k of k with respect to || || is defined to be the set of 
equivalence classes of Cauchy sequences in k:. 


One can define all the field operations on k. For instance, the product of the equivalence 
classes of the Cauchy sequences (a;) and (b;) is the equivalence class of (a,b;). (One can 
check that this is a Cauchy sequence, and that its equivalence class is unchanged if (a;) and 
(b;) are replaced by equivalent Cauchy sequences.) The 1 in k is the equivalence class of the 


sequence 1,1,1,.... If (x,;) is a Cauchy sequence not equivalent to (0,0,0,...), then the 2; 
are eventually nonzero, and setting y; := defines a Cauchy sequence whose 
0 it 2,=0 


equivalence class is an inverse of the equivalence class of (2;). 

Moreover, the operations satisfy all the field axioms, so k is a new field. The map k > k 
sending a to the equivalence class of the constant sequence (a, a,...) is a ring homomorphism, 
and ring homomorphisms between fields are always injective, so k is identified with a subfield 
of k. 

Define an absolute value || ||’ on & by decreeing that the absolute value of the equivalence 
class of (a;) is lim;_,. ||a;|]. The restriction of ||; ||’ to the embedded copy of k is just the 
original absolute value ||; ||. If a € k is represented by the Cauchy sequence (a;) in k, then 
the sequence (a;) viewed in k converges to a. 

The absolute value || ||/ is nonarchimedean if and only if || || was. (One way to see that 
is by using the characterization that ||; || is nonarchimedean if and only if ||n|| < 1 for all 
positive integers 7.) 

Finally, k is complete with respect to | ||. 


Example 5.7. The completion of Q with respect to the usual absolute value | | is the field 


R of real numbers. 


Proposition 5.8. Let k be a subfield of a complete field L. Then 


(1) The inclusion k + L extends to an embedding k — L. 
(2) If every element of L is a limit of a sequence ink, then the embedding ko L is an 


isomorphism. 


Proof. (1) Given an element a € k, represented as the limit of (a;) with a; € k, map 
a to the limit of (a;) in L. This defines a ring homomorphism k > L, which is 
automatically injective since these are fields. 

(2) Suppose that every element of L is a limit of a sequence in k. Given @ € L, choose 
a sequence (a;) in & converging to @. Then (a;) is Cauchy, so it also converges to an 
element a € k. This a maps to @, by definition of the embedding. So the embedding 


is surjective as well as injective; hence it is an isomorphism. 


6. INVERSE LIMITS 


Definition 6.1. An inverse system of sets is an infinite sequence of sets (A,) with maps 
between them as follows: 


ete A eA pe A, ee Ay 


Definition 6.2. The inverse limit A = lim A, of an inverse system of sets (A,,), (fn) as 
above is the set A whose elements are the infinite sequences (a,) with a, € A, for each 
n > 0 satisfying the compatibility condition f,(@n41) = dn for each n > 0. It comes with a 
projection map €,: A > A, that takes the n* term in the sequence. 


Remark 6.3. If the A, are groups and the f, are group homomorphisms, then the inverse 
limit A has the structure of a group: multiply sequences term-by-term. If the A, are rings 
and the f,, are ring homomorphisms, then the inverse limit A has the structure of a ring. 


7. DEFINING Z, AS AN INVERSE LIMIT 


Fix a prime p. Let A, be the ring Z/p"Z. Let f, be the ring homomorphism sending 
b:=b+ p""1Z to b:=b+p"Z. The ring of p-adic integers is Z, := lim Ap. 
For example, if p = 3, then a sequence like 


0 mod 1, 2 mod 3, 5 mod 9, 23 mod 27, --- 


defines an element of Zs. 
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8. PROPERTIES OF Z, 


Recall that a sequence of group homomorphism is exact if at the group in each position, 
the kernel of the outgoing arrow equals the image of the incoming arrow. For example, 


Ia 446436056 


is called a short exact sequence if f is injective, g is surjective, and g induces an isomorphism 
from B/A (or more precisely, B/f(A)) to C. 


Proposition 8.1. For each m > 0, 
(32,572,270 0 


is exact. (Here the first map is the multiplication-by-p™ map, sending (n)n>o to (p'"Gn)n>0-, 
and €m Maps (An)n>0 to Am.) 


Proof. First let us check that multiplication-by-p on Z, is injective. Suppose that a = 
(an) € Zp is in the kernel. Then pa = 0, so pa, = 0 in Z/p"Z for all n. In particular, 
PAn41 = 0 in Z/p"*!Z. That means that an41 = p"Yn41 for some Yn41 € Z/p"*'Z. But then 
On = fn(GQn41) = DP’ fn(Yn4i) = 0 in Z/p"Z. This holds for all n, so a = 0. 

Exactness on the left: Since multiplication-by-p is injective, composing this with itself m 
times shows that multiplication-by-p™ is injective. 

Exactness on the right: Given an element 3 € Z/p'™Z, choose an integer b that represents 
6. Then the constant sequence 6 represents an element of Z, mapping to /. 

Exactness in the middle: If a € Zp, then ém(pa) = p™e(a) = 0 in Z/p™Z. Thus the image 
of the incoming arrow (multiplication-by-p™) is contained in the kernel of the outgoing arrow 
(Ey). 

Conversely, suppose that x = (z,,) is in the kernel of €,,. So %, = 0. Then for all n > m, 
we have np € ae So there is a unique y,_m Mapping to x, via the isomorphism 


Z mm 
ae 
pr, pL, 


These ym are compatible (because the x, are), so as n ranges through integers > m, they 


form an element y € Z, such that py = x. So z is in the image of multiplication-by-p”. 


Proposition 8.2. 
(1) An element of Z, is a unit if and only if it is not divisible by p. In other words, the 
group of p-adic units ZF equals Zp — pZy. 
(2) Every nonzero a € Zy can be uniquely expressed as p™u with n € Zso and u € Z. 


Proof. 
11 


(1) Ifa = (a,) € Z,y is divisible by p, then a, = 0, so a cannot have an inverse. Conversely, 
if a = (ap) is not divisible by p, then a, € Z/p"Z is represented by an integer not 
divisible by p, so a, has an inverse b,, € Z/p"Z. These b,, must be compatible, and 


b := (b,) is an inverse of a in Zp. 


— 
Nw 
Na 


Existence: If a = (an) € Z, is nonzero, then there is a largest n such that a, = 0. For 
that n, Proposition [8.1] implies that a = pu for some u € Z,. Moreover, u cannot 
be divisible by p (since otherwise a,,,; = 0 too), so u is a unit. 

Uniqueness: Suppose that p’u = p™u’. If m = n, then using injectivity of 
multiplication-by-p™ we get u = u’, so the factorizations are the same. Otherwise, 
without loss of generality n > m. Then u’ = p"~u is a unit divisible by p, contra- 


dicting (ip. 


Multiplying nonzero elements p"u and p™u’ yields p"*™ uu’, whose (n-+-m+1)* component 
is nonzero, so Z, is an integral domain. In fact, Z, is a UFD with one prime! 


9. THE FIELD OF p-ADIC NUMBERS 


Definition 9.1. The field Q, of p-adic numbers is the fraction field of Zp. 


Each nonzero a € Q, is uniquely expressible as p"u with n € Z and u € ZF. (For 
existence, any nonzero a € Q, is (pu’)/(p™u) for some m,m! € Zso and u,u! € Zx, SO 
a=pr-™(wu), 

Define the p-adic valuation on Q, by v,(p"u) = n whenever n € Z and u € ZF, and 
Up(0) := +00. Then define |z|, := p~’) for each x € Q,. 

The ring Z injects into Z,, so its fraction field Q injects into Q,, and the p-adic valuation 
and absolute value on Q,, restrict to the p-adic valuation and absolute value on Q previously 
defined. 


Proposition 9.2. 


(1) The field Q, is complete with respect to | |p. 
(2) Every element of Q, is a limit of a sequence in Q. 


Proof. 


(1) Let (a,) be a Cauchy sequence in Q,. Then (a,) is bounded. By multiplying by 
a suitable power of p, we can reduce to the case where a, € Z, for all n. Choose 
an infinite subsequence S, whose image in Z/pZ is constant. Choose an infinite 
subsequence 35 of S, whose image in Z/p?Z is constant, and so on. Form a sequence 
by choosing one element from 5S), a later element from Sj, and so on. Then this 


subsequence converges in Z, to the element whose image in each Z/p"Z is the image 
12 


of the subsequence S;,. Finally, a Cauchy sequence with a convergent subsequence 
converges. 

(2) Let a € Q,. By multiplying by a suitable power of p, we reduce to the case where 
a€ Z,. Write a = (a,) with a, € Z/p"Z. Choose an integer 6, € Z representing ay. 
Then v,(a — bn) > n, so |a — b,| < p~”, so the sequence (b,,) converges to a in Q,. 


Combining Propositions [5.8] and [9.2|shows that Q, is the completion of Q with respect to 


| |p. 


10. p-ADIC EXPANSIONS 


Definition 10.1. Say that a series }>*°, a, of p-adic numbers converges if and only if the 
sequence of partial sums converges with respect to | |p. 


Theorem 10.2. 
(1) Eacha € Z, has a unique expansion a = bo +bip+bop*+--- with b, € {0,1,...,p—1} 
for all n. 
(2) Each a € Q has a unique expansion a = Yi ,e7 bnp” in which by € {0,1,...,p— 1} 
and b, =0 for all sufficiently negative n. 
(3) For either expansion, v,(a) is the least integer n such that b, 4 0. (If no such n 
exists, then a = 0 and v,(a) = +00.) 


Proof. 


(1) Existence: Write a = (a,) with a, € Z/p"Z. Choose s, € {0,1,...,p” — 1} repre- 
senting dp. Write s, = bo + bip +--+ + bpp”! with b; € {0,1,...,p —1}. The 
compatibility condition on the a, implies that the b; so defined are independent of 
n; i.e., the base-p expansion of s,,,,; extends the base-p expansion of s, by one term 
bpp”. Then sn — a in Q,, so 


bo + bip + bop? + --- =a. 
Uniqueness: If b/, € {0,1,...,p — 1} also satisfy 
by + Dip + bop? +--+ =a, 
then we get 
by + bip +--+ + bpp’? = bo + Uipt--- +0, 1p" (mod p"), 
but both sides are integers in {0,1,...,p" — 1}, so they are equal, and this forces 


b; = 0; for all 2. 
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(2) Existence: If a € Q,, then there exists m € Z such that pa € Z,. Write 
pa = by + bip + bop? + --- 


with b; € {0,1,...,p— 1} and divide by p™. 
Uniqueness: Follows from uniqueness for Zp. 
(3) Ifa=bo +hip+--- € Z, with b; € {0,1,...,p—1}, and bp # 0, then a has nonzero 
image in Z/pZ, so a is a unit, and v,(a) = 0. The general case follows from this one 
by multiplying by p” for an arbitrary n € Z. 


11. SOLUTIONS TO POLYNOMIAL EQUATIONS 


Lemma 11.1 (“Compactness argument”). Let --- > Sy > S; + So be an inverse system 
of finite nonempty sets. Then him 5; is nonempty. 


Proof. Let T;9 be the image of S; + --- + So. Then 


se G Too S Tio © Ton, 


but these are finite nonempty sets, so T;,9 must be constant for sufficiently large 7. Let Ep be 
this “eventual image”. Define JT; and £; in the same way, and define Ey, and so on. Then 
the E; form an inverse system in which the maps Ej; — EF; are surjective. Choose eg € Ep, 
choose a preimage e; € FE; of e€9, choose a preimage eg € EF» of e;, and so on: this defines an 


element of lim Sj. 


Proposition 11.2. Let f € Z,|x] be a polynomial. Then the following are equivalent: 


(1) The equation f(x) =0 has a solution in Zp. 
(2) The equation f(x) =0 has a solution in Z/p"Z for every n > 0. 


Proof. Let S, be the set of solutions in Z/p"Z. Then lim S;, C lim Z/p"Z = Z, is the 
set of solutions in Zp. We have lim Sp, # () if and only if all the S, are nonempty, by 


Lemma {11.1 


12. HENSEL’S LEMMA 


Hensel’s lemma says that approximate zeros of polynomials can be improved to exact 


ZeYros. 


Theorem 12.1 (Hensel’s lemma). Let f € Z,|x]. Suppose that f(a) = 0 (mod p), and 
f'(a) #0 (mod p). (That is, a is a simple root of (f mod p).) Then there exists a unique 
be Z, with b=a (mod p) such that f(b) =0. 
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Proof. We prove by induction that for n > 1 there exists a, € Z, such that a, =a (mod p) 
and f(a,) = 0 (mod p”) (and that a, mod p” is uniquely determined). For n = 1, take 
a, = a. Now suppose that the result is known for some n > 1. So 


f (an) = p"c, 
for some c € Z,. We try to adjust a, slightly to make the value of f even smaller p-adically. 
More precisely, we try dn41 = G,+€ for a p-adic integer € to be determined: 'Taylor’s theorem 
gives 
f(Qn4i) = f(@n) + f'(an)e + glee? 
for some polynomial g(x) € Z,[x]. (This is really just expanding f(a, +) as a polynomial 
in €.) Choose € = p"z with z € Z,. Then 
f (Gn+1) = F (Gn) + f’(an)p"2 + g(p"z)p*"2" 
= p"c+ f'(an)p"2z (mod p”**). 


Since 


we get 
f(an4i) = (c+ f'(@z)p"_ (mod p"**), 
and there is a unique z mod p that makes c+ f’(a)z =0 (mod p), and hence a unique choice 
of Gn41 mod p"*! that makes f(ay4,) = 0 (mod p"*!) This completes the inductive step. 
Since f(z) = 0 has a unique solution in each Z/p"Z congruent to a modulo p, these 


solutions give a unique solution in Z, congruent to a modulo p. 


This is the p-adic analogue of Newton’s method, in which one approximates the poly- 
nomial by a linear function in order to pass from an approximate zero to an even better 


approximation to a zero. 


13. STRUCTURE OF QF 
The map €,: Z, + Z/p"Z restricts to a surjective homomorphism 
Ly —> (Z/p"Z)”. 
Its kernel is U, := 1+ p"Zpy. So ZF /U, = (Z/p"Z)™*, and 


Z% ~ lim ZX /Uy ~ Yim(Z/p"Z)*. 


The U,, form a descending chain of subgroups inside Z;: 


Pearle U3 © Uy, CU, © Zip - 


Let F, := Z/pZ. (Generally one writes F, when F, is being thought of as a field, and 


Z/pZ when it is being thought of as a ring or an abelian group.) 
15 


Lemma 13.1. The quotients in the filtration are: 


(1) ZS /U, = FY, and 


p? 


(2) Un/Unyi ~ Z/pZ for alin > 1. 


Proof. The first of these has already been proved. For the second, observe that 


U, > Z/pZ 
1+p"z + (z mod p) 


is surjective and has kernel U,,,,. 


Corollary 13.2. The order of U,/U,, is p"™. 


Proposition 13.3. Let [1p_1 be the set of solutions to x?~' = 1 in Zi. Then fp-1 18 a group 


(under multiplication) mapping isomorphically to FX, and ZY = Uy X pp-1. 


Proof. The set fip—1 is the kernel of the (p — 1) power map from Z, to itself, so it is a 
group. Given a € {1,2,...,p—1}, Hensel’s lemma shows that j1,_; contains a unique p-adic 
integer congruent to a modulo p. And there are no elements of j4,_; congruent to 0 mod p. 


So reduction modulo p induces an isomorphism ffp_1 — FY. 


We have U; M fly_1 = {1} (by Hensel’s lemma, there is only one solution to z?~! — 1 = 0 
congruent to 1 modulo p). Also, U; + Mp1 = ZF, since any a € Z* can be divided by an 
element of 1p; congruent to a modulo p to land in U;. Thus the direct product Uy X pp-1 


is equal to Z. 


Lemma 13.4. Let p be a prime. If p #2, letn > 1; ifp = 2, letn > 2. Ifa € Un — Un41, 
then x” = Unis — Unsa- 


Proof. We have x = 1+ kp” for some k not divisible by p. Then 
mene (;) es ()) i a 


=1+4+ kp" ‘Gnod:p"). 


so x? € Uni — Unto. 


Proposition 13.5. [fp #2, then U, ~ Z,. If p= 2, then Uy = {£1} x U2 and U2 ~ Zp. 


Proof. Suppose then p # 2. Let a = 1+ p € U, — Ug. By the previous lemma applied 

repeatedly, QP € Uis1 — Ujsg. Let a, be the image of a in U,/U,. Then ae # 1 but 

ap" = 1, so a, has exact order p"~!. On the other hand, the group it belongs to, U;/Un, 

also has order p"~!. So U;/U,, is cyclic, generated by a,. We have an isomorphism of inverse 
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systems 
+ Z/p"Z — Z|p"1Z — --. 


| 


= 0 a ee 

Taking inverse limits shows that Z, ~ U4. 
For p = 2, the same argument with a = 1 +4 works to prove that Z2 ~ U2. Now {+1} 
and Uy, have trivial intersection, and they generate U, (since Uz has index 2 in U;), so the 
direct product {+1} x U2 equals Uj. 


Theorem 13.6. 
(1) The group Z* is isomorphic to Z/(p—1)Z x Z, ifp # 2, and to Z/2Z x Zy if p = 2. 
(2) The group Q* is isomorphic to Z x Z/(p—1)Z x Z, if p #2, and to Z x Z/2Z x Zy 


yp=2. 
Proof. 
(1) Combine Propositions and 
(2) The map 


Zx Zi + Q 
(n,u) > p™u 


is an isomorphism of groups. Now substitute the known structure of Z> into this. 


14. SQUARES IN Q> 
14.1. The case of odd p. 


Theorem 14.1. 
(1) An element p™u € QF (withn € Z andu € ZY) is a square if and only ifn is even 


and u mod p is a square in Fy. 


(2) We have Q* /Q%? ~ (Z/2Z)?. 
(3) For any c € Z¥ with cmod p ¢ FX? the images of p and c generate Q /Q. 


D2 


Proof. 
(1) We have Q* = p* x FX x Z,, and 2Z, = Zp, so 


P 


x2 27, 1x2 r 
Q =p" xF>° x Z. 


Thus an element p”u is a square if and only if n is even and u mod p € es 
(2) Using the same decomposition, Q* /Q*? = (Z/2Z) x (Fx /F*?) x {0} ~ (Z/2Z)* since 
Fy is cyclic of even order. 
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(3) Under the isomorphism above, p and c correspond to the generators of the two copies 
of Z/2Z. 


14.2. The case p = 2. 


Theorem 14.2. 


(1) An element 2"u € QS (withn € Z and u € Z}) is a square if and only if n is even 
and u = 1 (mod 8). 

(2) We have Q¥ /Q3? ~ (Z/2Z). 

(3) The images of 2, —1, 5 generate QX /Q>?. 

Proof. 

(1) We have Q¥ = 2% x {+1} x Uz, where Up ~ Zp. Under this last isomorphism, U3 

corresponds to 2Z., so 
Ono I Us, 

Thus an element 2”u is a square if and only if n is even and u=1 (mod 8). 

(2) Using the same decomposition, 


QX /Qz? = (Z/2Z) x {+1} x Ze /2Z_ ~ (Z/2Z)°. 


(3) Under the isomorphism above, 2, —1, and 5 correspond to the generators of the three 
copies of Z/2Z. 


15. p-ADIC ANALYTIC FUNCTIONS 


A power series f(z) := })a,z”" with a, € Q, defines a differentiable function on the open 
set in Q, on which it converges. 

Identities between complex power series with rational coefficients can be used to deduce 
identities between p-adic power series. For example, consider the formal power series 


oe 
zy eS ee ee ae 
2 3 
ge. 
z+5+o+ 


in Q|[z]]. Over C, they define the analytic functions log(1 + z) and e* — 1, respectively, in 
some neighborhoods of 0. They are inverses to each other. So their composition in either 
order is a function represented by z € Q|[z]]. On the other hand, their composition in either 
order is represented also by the formal composition of the power series in Q||z]]. Two formal 
power series representing the same analytic function are the same, so the formal power series 


are inverse to each other. Finally, this identity saying that the composition of the formal 
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power series in either order gives z implies that the corresponding p-adic analytic functions 
are inverses of each other when both converge. 


16. ALGEBRAIC CLOSURE 
Given a field k, let k[z]> be the set of polynomials in kx] of degree at least 1. 


Definition 16.1. A field k is algebraically closed if and only if every f € k[z]>1 has a zero 


in k. 


Definition 16.2. An algebraic closure of a field k is a algebraic field extension k of k that is 
algebraically closed. 


Example 16.3. The field C of complex numbers is an algebraic closure of R. But C is not 


an algebraic closure of Q because some elements of C (like e and 7) are not algebraic over 


Q. 
Theorem 16.4. Every field k has an algebraic closure, and any two algebraic closures of k 
are isomorphic over k (but the isomorphism is not necessarily unique). 


Step 1: Given f € k[z]s1, there exists a field extension E D k in which f has a zero. 


Proof. Choose an irreducible factor g of f. Define E := k[z|/(g(x)). Then FE is a field 
extension of k, and the image of x in F is a zero of f. 


Step 2: Given f1,...,fn € k[z]s1, there exists a field extension E D k in which each f; 
has a zero. 


Proof. Step 1 and induction. 


Step 3: There exists a field extension k’ > k containing a zero of every f € k[x]s1. 
Proof. Define a commutative ring 
kX sp: f € kle]ai}] 
(f(Xy) : f € k[x]>1) 
Suppose that A is the zero ring. Then 1 is in the ideal generated by the f(X;). So we 


A:= 


have an equation 
l= nsi(Xsf,) eal GnFn(X fn): 
for some polynomials g;. By Step 2, there exists a field extension F' > k containing a zero 
a, of each f;. Evaluating the previous equation at Xs, = a; yields 1 = 0 in F’, contradicting 
the fact that F is a field. 
Thus A is not the zero ring. So A has a maximal ideal m. Let k’ := A/m. Then k’ is a 


field extension of k, and the image of X;, in k’ is a zero of Fj. 


Step 4: There exists an algebraically closed field ED k. 
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Proof. Iterate Step 3 to obtain a chain of fields 


RCM Cec EM Cass 


Let E be their union. Any polynomial in E[z];, has coefficients in some fixed k™, and 


hence has a zero in k*, so it has a zero in E. Thus FE is algebraically closed. 
Step 5: There exists an algebraic closure k of k. 


Proof. Let E be as in Step 4. Let k be the set of a € E that are algebraic over k. Since 
algebraic elements are closed under addition, multiplication, etc., the set k is a subfield of 
E. And of course, k is algebraic over k. 

If f € k[a]s1, let 8 be a zero of f in E; then is algebraic over the field k(coefficients of f), 
which is algebraic over k, so ( is algebraic over k, so 3 € k. Thus k is algebraically closed. 


Step 6: If F is an algebraic extension of k, and L is an algebraically closed field then any 
embedding k ~ L extends to an embedding E' — L. 


Proof. If E is generated by one element a, then E ~ k/z]/(f(x)) for some f € k[x]>1. Choose 
a zero a’ € L of f, and define FE — L by mapping a to a’. 
If E is generated by finitely many elements, extend the embedding in stages, adjoining 


one element at a time. 


In general, use transfinite induction (Zorn’s lemma). 


Step 7: Any two algebraic closures of k are isomorphic over k. 


Proof. Let E and L be two algebraic closures of k. Step 6 extends k @— L to E @& L. If 
E # L, then the minimal polynomial of an element of L — E would be a polynomial in 


E|x|s1, contradicting the assumption that E is algebraically closed. 


17. FINITE FIELDS 


Let F, be Z/pZ viewed as a field. 


Theorem 17.1. For each prime p, choose an algebraic closure F, of Fp. 
Ly 


(1) Given a prime power q = p", there exists a unique subfield of F,, of order q, namely 
Pg es Feet aah 

(2) Every finite field is isomorphic to exactly one Fy. 
(3) Fym CF» if and only if m|n. 

(4) Gal(Fan/F,) ~ Z/nZ, and it is generated by 


Frobg: Fan > Fy 
re xt, 


Proof. 
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(1) The p* power map 


Frob, : Ee _ F, 


rr xP. 


is a field homomorphism, by the binomial theorem. In particular, it is injective. Since 


7 


F,, is algebraically closed, Frob, is also surjective. So Frob, is an automorphism of 


F,. If g =p”, then the gq power map Frob, is Frob;,, so it too is an automorphism 


of F,. Then F, is the subset of F, fixed by Frob,, so F, is a field. Since x? — x and 


(x4 — x) = —1 have no common zeros, the polynomial x? — x has q distinct zeros 


in F,. Thus #F, = q. This proves the existence half of (1). 


(2) (and uniqueness in (1)) Conversely, if K is any finite field, then the characteristic of 
kK is a prime p > 0, and the image of Z — K is a subfield isomorphic to F,,. Viewing 


kK as an F,-vector space shows that #K = p” for some n > 1. Let q = p”. The 


embedding F, — F,, extends to an embedding K < F,. Since K™ is a group of order 
q — 1, every element of K™ satisfies x7~' = 1, so every element of K satisfies x4 = x, 


sok CF,. But #K = #F, = 49, s0 K =F,. Finally, kK cannot be isomorphic to 


any Fy with q' 4 q, because its size is q. 


(3) If Fym C Fyn, then F,n is a vector space over F,m, so p” is a power of p™ (namely, p™ 


raised to the dimension), so m|n. 


Conversely, if m|n, write n = rm; then 


Fom = {fixed points of Frobym } 
C {fixed points of (Frobpm)"} 
= {fixed points of Frob,rm } 


(4) The order of Froby € Aut(F,») is the smallest m such that 7” = x for all x € Fyn, 
which is n. In general, if G is a finite subgroup of Aut(A’), then K is Galois over the 
fixed field K© and Gal(K/K°) =G. Apply this to K = Fy» and G the cyclic group 
of order n generated by Frob, € Aut(F,): the fixed field is F,, so we get 


Gal(Fyn/F,) = G ~ Z/nZ. 


The primitive element theorem says that every finite separable extension of a field k 
is generated by one element a, i.e., is of the form k|a|/(f(x)) for some monic irreducible 


polynomial f(x) € k[x] (the minimal polynomial of a). So we get 
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Corollary 17.2. Given a prime power q and n > 1, there exists a monic irreducible poly- 


nomial f(x) € F,[x] of degree n. 


Remark 17.3. It is not known whether one can find such a polynomial in deterministic 
polynomial time! This is unsolved even for g prime and n = 2: i.e., the problem of finding 


a nonsquare in F,, in time polynomial in log p is unsolved. 


On the other hand, if one repeatedly chooses a random monic polynomial over F, of 


degree n, then there is a fast test for irreducibility, and one can estimate the probability of 
irreducibility to show that this succeeds in random polynomial time. 


Example 17.4. F,[t|/(t® +t + 1) is a finite field of order 8. 


Warnings: Fs # Z/8Z (the latter is not even a field), and F, ¢ Fs. 


Proposition 17.5. [fk is a field, and G is a finite subgroup of k*, then G is cyclic. 


Proof. As an abstract group, 


Z, Z, 
Gag uee 
a,Z An 


for some positive integers a; satisfying a, > 1 and a,|a;,; for all 7. Ifn > 1, then G has more 


than a; elements of order dividing a,. But x*! — 1 can have at most deg(a™! — 1) = a; zeros 


in k. Thus n = 1, so G is cyclic. 


Remark 17.6. There is an alternative proof that avoids the structure theorem for finite 
abelian groups, and instead uses a more elementary counting argument to prove that if G is 
a finite group of order n such that for each d\n, the group G has at most d elements satisfying 
x? = 1, then G is cyclic. 


Corollary 17.7. The group Fj is cyclic of order q — 1. 


18. INVERSE LIMITS IN GENERAL 


Earlier we defined the inverse limit him S; of a sequence of sets S; indexed by the natural 
numbers equipped with maps $;,, — S;. Now we will define lim S; given a collection of sets 
(S;)ier for more general index sets, equipped with maps. 


Definition 18.1. A partially ordered set (poset) is a set J equipped with a binary relation < 
such that for all x,y,z € J, 

(PO1) «x <x (reflexivity) 

(PO2) Ifx<yandy <a, then x = y (antisymmetry) 

(PO3) Ifz<yand y < z, then x = z (transitivity). 


Definition 18.2. A directed poset is a nonempty poset J such that 
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(PO4) For every x,y € J, there exists z with x > z and y > z (any finite subset has an 
upper bound). 


Example 18.3. The set Zs with the usual ordering is a directed poset. 


Example 18.4. The set Zso with the ordering in which m < n means m|n is a directed 
poset. 


Definition 18.5. An inverse system of sets is a collection of sets (S;)ie7 indexed by a directed 
poset J, together with maps o! : S; — S; for each 7 < 7 satisfying the following for all 
i,j,k ET: 
(IS1) ¢': S; > S; is the identity 
ok J 
(IS2) For any i <j <k, the composition S, —>+ S; BEN S; equals oF. 
Definition 18.6. The inverse limit of an inverse system of sets (S;);er with maps (¢/);<; is 
lim Si := {9 € [| s: : @) (a;) = a, for all i <i] ; 
wel iel 
equipped with the projection map 7; to each S}. 


So to give an element of him P 5; is to give an element of each S; such that the elements 
are compatible with respect to the maps in the inverse system. 


Example 18.7. If J is Zso with the usual ordering, then this definition of inverse limit re- 


duces to the earlier one. It might seem that there are more maps o! , but they are determined 
as compositions of the maps ¢/?. 
If the S; are groups and the ¢! are group homomorphisms, then lim. 7 S; is a group. 


Example 18.8. Let J be Zxo ordered by divisibility. For each n € I, let G, = Z/nZ. For 
n|N, define 


oN: Z/NZ > Z/nZ 
at> @. 
Then the inverse limit lim Z/nZ is a group called Z. 


Example 18.9. Let G be any group. Let J be the collection of normal subgroups N < G 
of finite index. Define N < N’ if N’ C N. This makes J into a directed poset. (Given 
N,, No € I, the intersection N, M No is an “upper bound” for Ni and No.) If N’ C N, we 
have a surjective homomorphism 
G/N' > G/N 
gq. 
The inverse limit G := lim, G/N is a group called the profinite completion of G. 
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Example 18.10. The profinite completion of Z is Z. 


Theorem 18.11. Let S be the inverse limit of an inverse system of sets (S;)icr with maps 
(¢})i<j;- Then S has the following universal property: 


(1) There are maps 7: S > S; fori € I, compatible with respect to the ob); i; 


5 
me 


FG. 


& 


commutes for alli < j. 

(2) For any other set T equipped with maps g;: T > S; fori € I, compatible with respect 
to the ¢!, there exists a unique map a: T —> S such that g;(t) = m;(a(t)) for alli € I 
andt eT: 


Example 18.12. Let G be the profinite completion of a group G. Then G has a natural 
quotient map to G/N for each finite-index normal subgroup N < G. These maps are com- 
patible with the maps G/N’ —+ G/N of the inverse system, so the universal property yields 
a homomorphism G —> G, 


Dag 


Proposition 18.13. There is an isomorphism Z — Plsdaus p Lp 


Proof. Fix a prime p. Let I be Zyo ordered by divisibility. For n € I, let Gy := Z/p”™Z. 
For m|n, let G, > Gm be the quotient map sending 1 to 1. To give a compatible collection 
of elements of the G,, is equivalent to giving a compatible collection of elements of Z/p™Z 
for m > 0, so lim Gy Dig: 
For each n € J, the Chinese remainder theorem gives a natural isomorphism 
Z Z 
a@a~ UU jawe 
primes p 


Taking the inverse limit of both sides yields 


Z~ | | Ly. 
primes p 
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19. PROFINITE GROUPS 


Definition 19.1. A profinite group is an inverse limit lim, G; of finite groups Gj. 


Examples: 


(1) Zy = = lim Z/p"Z for any prime p 

(2) 2 = lim Z/nZ 

(3) GLp(Ze) = lim, GL, (Z/p"Z) for any fixed prime p and fixed r > 0. 
(4) The ee completion of any group. 


19.1. Order. 


Definition 19.2. Assuming that the inverse system maps G; — G; are all surjective, the 
order #G of a profinite group G := lim ji G; is the least common multiple of #G;, interpreted 
as a Supernatural number IL, p® where each e, is either a nonnegative integer or oo. 


Example 19.3. #Z*% = 275°. 


19.2. Topology on a profinite group. (This subsection is for those who know the basic 
definitions of topology.) The profinite topology on a profinite group G = lim, ep Ci ; is con- 
structed as follows. Equip each finite group G; with the discrete topology. Equip [],<,G 
with the product topology. Then G = him _ G; is a closed subset of [[,-; Gi, and we give " 
the subspace topology. By Tychonoff’s theorem, [];.,; Gi is compact, so its closed subset G 
is compact too. 


19.3. Subgroups. The profinite group G is equipped with group homomorphisms 7;: G —> 
G;. If H; is a subgroup of G;, then 7; '(H;) is a subgroup of G. These are called the open 
subgroups of G. 

If for every 7 we choose a subgroup H; of G; such that each o! : G; + G; maps H; into 
HAf;, then lim _ Hf is a subgroup of G = him _, G;. These are called the closed subgroups of 
G. 

The open subgroups are exactly the closed subgroups of finite index. In particular, every 
open subgroup is a closed subgroup, but not vice versa in general. 


Example 19.4. The profinite topology on Z, := lim Z/ p"Z agrees with the topology coming 
from | |,. The open subgroups of Z, are the subgroups p°Z, for e = 0,1,2,.... The closed 
subgroups are these together with the trivial subgroup {0}. 


Subgroups of a profinite group that are not even closed are generally worthless! When one 


encounters such a subgroup, one takes its closure right away. 
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20. REVIEW OF FIELD THEORY 


We recall some definitions of field theory. Let L/k be an algebraic field extension. 


Definition 20.1. The extension L/k is normal if it satisfies one of the following equivalent 


conditions: 


(1) Every irreducible polynomial in k[x] with a zero in L factors completely into linear 
factors in L{z]. 

(2) If we embed L in an algebraic closure of k, so k C L Ck, then every o € Aut(k/k) 
satisfies o(L) = L. 


Definition 20.2. A polynomial f(x) € k[z] is separable if it satisfies one of the following 


equivalent conditions: 


(1) When factored in k[z] for an algebraic closure k of k, it has no repeated factors. 
(2) The polynomial f(x) and its derivative f’(x) have no common zeros in k. 
(3) We have gcd(f(x), f’(x)) = 1 in k[z]. 


We will usually be applying the notion of separable to minimal polynomials, which are 
irreducible. Over a field & of characteristic 0, every irreducible polynomial is separable. 
Proof: We have deg f’(x) < deg f(x), and chark = 0 implies f’(x) 4 0, so f’(x) is not 
divisible by f(x). so gcd(f(x), f’(x)) = 1. 

Thus separability is an issue mainly in the case of characteristic p > 0. 


Definition 20.3. An element a in L is separable over k if it satisfies one of the following 
equivalent conditions: 
(1) It is a zero of a separable polynomial in k{z]. 
(2) The minimal polynomial of a over k is separable. 
(3) Either char k = 0, or char k = p and the minimal polynomial of a over k is not of the 
form g(x”) for a polynomial g(x) € k[z]. 


The set of elements of L that are separable over k form an intermediate subfield. 
Definition 20.4. If every element of L is separable over k, then L is called separable over k. 


By the remark preceding the definition, it is enough if L is generated by separable elements. 
If k is a field of characteristic p, the image of the p-power Frobenius endomorphism k — k 
is a subfield k? := {a?: a€k} of k. 


Definition 20.5. A field k is perfect if it satisfies one of the following equivalent conditions: 
e Either char k = 0, or chark = p and k = k?. 
e Every finite extension of k is separable over k. 


e Every algebraic extension of k is separable over k. 
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Example 20.6. Finite fields are perfect. 


Example 20.7. The prototypical example of an imperfect field is k = F,(t). The prototyp- 


ical example of an inseparable extension is the extension L = k(t'/?) of this k. The minimal 
polynomial of t!/? over k is 2? — t, which is irreducible (as minimal polynomials always are), 
but not separable. 


Definition 20.8. Call L/k Galois if it is both normal and separable. In this case, the Galois 
group Gal(L/k) is the set of automorphisms o of L such that o(x) = x for all x Ek. 


Definition 20.9. If blah is a property of a group (e.g., abelian), call L/k blah if L/k isa 
Galois extension and Gal(L/k) is blah. 


Definition 20.10. Let k be a field. Choose an algebraic closure k. The separable closure of 
k (in a fixed algebraic closure k) is k8°P := {a € k : a is separable over k}. It is the maximal 
subfield of k that is separable over k. 


The extension k*?/k is Galois. 


Definition 20.11. The absolute Galois group of k is G, := Gal(k*°?/k). 


21. INFINITE GALOIS THEORY 


Let AK/k be a Galois extension (possibly of infinite degree). Let J be the set of fields F’ 
such that k C FC K and F is a finite Galois extension of k. Order I by inclusion. 


Proposition 21.1. 
(1) If FF’ € I, then their compositum FF" (the subfield of K generated by F and F") 
is in I too. 
(2) I is a directed poset 
(3) Ifk CE CK and E is finite over k, then E C F for some F € I. 
(4) Ure P= K. 
Proof. 
(1) This is a well-known fact about Galois extensions. 
(2) This follows from (1). 
(3) The primitive element theorem expresses as Eas k|x]/(f(x)). Let F' be the splitting 
field of f(z). 
(4) This follows from (3). 


For each F' € J, the group Gal(F’/k) is finite. If F C F”, then we have 
oe: Gal(F'/k) —» Gal(F/k) 


or O|p. 
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Proposition 21.2. For any Galois extension K/k, there is an isomorphism 
Gal(K/k) > lim Gal(£°/k:) 
Fel 


o++ (o|F)rer- 


Proof. Each F is normal over k, so o|7 maps F' to F’. The ol are compatible. So by the 
universal property of the inverse limit, we have a well-defined homomorphism 


Gal(K/k) + lim Gal(F/k). 


Conversely, compatible elements of Gal(F'/k) for all F' € J, glue to give a unique automor- 
phism in Gal(K/k). 


Corollary 21.3. Any Galois group Gal(K/k) can be viewed as a profinite group. In this 
setting, the profinite topology is also called the Krull topology. 


Theorem 21.4 (Main theorem of Galois theory). Let K/k be a Galois extension. Let 
G = Gal(K/k). Then there exists an inclusion-reversing bijection 
{fields E such that k C E C Kk} © {closed subgroups of G'} 
E & Gal(K/E) 
K" oH. 
Moreover, if E & H, then 


E/k is normal = > 4H is normal in G 
(and then Gal(E/K) ~ G/H) 
E/k is finite <= > H is open in G. 
21.1. Examples of Galois groups. If k = C, then G; = Gal(C/C) = {1}. 


If k = R, then G, = Gal(C/R) ~ Z/2Z, generated by complex conjugation. 
Ifk =F,, then 


G, = Gal(F,/F,) ~limZ/nZ=Z~ |] Z,, 


prime p 


a pro-cyclic group (an inverse limit of finite cyclic groups). It is topologically generated by 
Frobg; i.e., Gx is the closure of the infinite cyclic subgroup generated by Frob,. 

If k = Q,, then it turns out that G, is a pro-solvable group (an inverse limit of finite 
solvable groups), whose structure is known exactly but is rather complicated. Also, for each 
n > 1, there are only finitely many degree-n extensions of Q,, in Q,- 

If k = Q, then G; is incredibly complicated. Conjecturally every finite group is a quotient 


of it; i.e., every finite group is Gal(F’/Q) for some finite Galois extension F’ of Q. 
28 


Let Q*” be the subfield of Q generated by all finite abelian extensions F/Q. Then 
Gal(Q?”/Q) is abelian; in fact, it is the largest abelian quotient of Gg (where we allow 
quotients only by closed subgroups). 

Let ¢, be a primitive n** root of 1 in Q. “Irreducibility of the cyclotomic polynomial” 
implies that Gal(Q(¢,)/Q) ~ (Z/nZ)”. 


Theorem 21.5 (Kronecker-Weber). Q* = U,,s; Q(4n). 


For instance, Q(/7) is an abelian extension of Q, so the Kronecker-Weber theorem implies 
that /7 must be an element of Q(¢,,) for some n. (In fact, the smallest such n is 28.) 


Corollary 21.6. 
Det dh 

ab ~~ 7 ~~ x ~~ x 

Gal(Q*”/Q) ~ jm (<5) Te eS | | Das 


P 


Class field theory generalizes this to describe the maximal abelian extension k®? of any 
number field k. 
22. AFFINE VARIETIES 
From now on, k is a perfect field, and k is a fixed algebraic closure. Let G, := Gal(k/k). 
22.1. Affine space. 
Definition 22.1. Fix n € Zs. For each field extension L of k, define 
A? (L) := L” 

Here A? is called n-dimensional affine space over k. (If k is understood, we just write A”.) 
Think of k[21,...,2,] as being the ring of functions on Aj. This relationship is written 
Ay = Spec k|ay,: -.; 2m: 

Remark 22.2. The group Gy acts on A"(k), and the set of fixed points A"(k)% is A”"(k). 


22.2. Affine varieties. Loosely speaking, an affine variety is the set of common zeros of a 
set of polynomials. 
Given a subset T of k[x1,...,2n], define Z = Zr by the rule 


Z(L) :={PeL": f(P) =0 for all f € T}. 


Any such Z is called an affine variety over k. (Some authors also require an “irreducibility” 
condition. ) 


Definition 22.3. An element of Z(L) is called an L-rational point on Z, or simply an L-point. 
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Example 22.4. Take k = R, n = 2, and T = {x7 + y? — 1}. Then Z(R) is the unit circle 
in R?. We say “Z is the variety defined by x? + y? = 1 over R”. 


The set of polynomials in k[x1,...,%p| that vanish at a point P is closed under addition and 
closed under multiplication by an arbitrary polynomial. So if J is the ideal of klay,..., 2p] 
generated by TJ, then Z; = Zr. 


Example 22.5. The zero set of x? + y* — 1 and the zero set. of (x? + y? — 1)? in L? for any 
field extension L of k are equal. 


More generally, any ideal J defines the same set of zeros as its radical 
VI :={f €klxi,...,¢n] : f™ € I for some m > O}. 
So we will assume that J is radical (J = VT) from now on. 
Theorem 22.6 (version of Hilbert Nullstellensatz). There is an inclusion-reversing bijection 
{radical ideals of k{a1,...,2,]} < {affine varieties Z in AZ} 
I+ Z, (where Z;(L) = {common zeros of f € I} 
{f € klxy,...,¢n]: f(P) =0 for all P € Z(L) for all L} GZ. 


We can view elements of k[x1,...,%,] as functions on Z, but the functions in J are iden- 
tically 0 on Z, so the ring of functions on Z is actually k[a1,...,2n]/I. Thus we write 
EGii cay lly 
Z = Spec Kimisi sain : 7 J 


[x1 ytd Ln] 
I 


The commutative ring is called the affine coordinate ring of Z. 


Example 22.7. Let X = Spec arn and let Y = Spec nee Ix = y? 


No! One reason: X(C) is nonempty, but Y(C) is empty. Another reason: the ideal 


(x? + y? + 1) is not the unit ideal (1), since x? + y? + 1 has no inverse in R[z, yJ. 


Moral: When k is not algebraically closed, it is important to consider Z(L) for all finite 
extensions L of k instead of just viewing of Z as the set of zeros with coordinates in k. 


Remark 22.8. If Z is any affine k-variety, then Z(k) = Z(k)°%. 


22.3. Irreducible varieties. The variety defined by xy = 0 is the union of the two varieties 
defined by x = 0 and y = 0 in A?. 


Definition 22.9. An irreducible variety is a nonempty variety that cannot be decomposed 


as a union of two smaller varieties. 


One can show that a general variety Z is a finite union of irreducible subvarieties, none 
contained in any other: these are called the irreducible components of Z. 


One can show: 
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Proposition 22.10. Suppose that Z = Speck|a,...,2n|/I, where I is radical. Then the 
following are equivalent: 


e Z is irreducible. 
e [ is a prime ideal. 
e kix,...,%n]/I is an integral domain. 


If Z is irreducible, the function field «(Z) of Z is defined as the fraction field Frac k[x1,...,@n]/J. 


Example 22.11. The function field of Aj is the rational function field 


Ki Wisex2 gg) = 2 -f,g9€ Keys. stal . 


22.4. Dimension. There are a couple of equivalent ways to define dimension of a variety 
Xx. 


Definition 22.12. The dimension dim X of X is the largest integer d such that there exists 
a chain of (closed) irreducible varieties 


Zo GA Gore Cay 
contained in X. (If X = @, then dim X = —oo.) 
An alternative, equivalent definition: 


Definition 22.13. Let X be an irreducible variety. Then dim X is the smallest integer d 
such that the function field «(X) contains elements f;,..., fq such that «(X) is algebraic 
over the subfield k(fi,..., fn) generated by & and the f; inside K(X). 

Then, for any variety X, define dim X as the maximum of the dimensions of its irreducible 


components. 
(Proving the equivalence requires a lot of commutative algebra.) 


Example 22.14. We have dim A” = n. A maximal chain of irreducible subvarieties is 


PGs Oe ae Gres as Che 


corresponding to the chain of prime ideals 


(Piet aeg ly) (Po, cag 8a) tee De) (), 
of k[x1,..., 2p]. (It takes some work to show that there is no longer chain.) 
Alternatively, the function field k(21,...,2,) is algebraic over the subfield generated by 
n elements 21,...,%p. (It takes some work to show that one cannot do it with less than n 
elements. ) 
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22.5. Smooth varieties. 


Definition 22.15. A hypersurface in A? is a subvariety defined by a single equation f(71,...,%) = 
0 with f a nonzero polynomial in k[x1,..., vp]. 


Definition 22.16. Let X be a hypersurface f(a,...,%,) = 0 in Ay. A point P € X(L) 
(for some field extension L of k is a singularity of X if of (Pp) = 0 for all 7. 


The set of singularities forms a subvariety of X, defined by f = 0 together with the 
equations oh =O ford = 10.5%: 


Definition 22.17. A hypersurface X in Aj? is called smooth (of dimension n — 1) or non- 


singular if there are no singularities in X(L) for any L D k (actually it suffices to check 


Example 22.18. Let X be the curve y? = 2° + 1 in Ag. Is X singular? Let f(2,y) := 
y? — x — 1. The singular locus is defined by the equations 


y= = 1=0 
—32? = 0 
2y = 0, 


which have no common solutions in Q, so the curve is smooth. 


(But it would not have been so if instead of Q we were working over the field Fy, or F3.) 


Example 22.19. Let Y be the “nodal cubic” y? = x? + 2?. The singular locus is defined by 
the equations 


y?—23— 2? =0 
—3r? — 27 =0 
2y = 0, 


which have the common solution (0,0). So Y is singular, with a unique singularity at (0,0). 
Near (0,0), the curve Y looks approximately like y? = x? (obtained by discarding higher 
order terms like x?) so it has two “branches” crossing at (0,0). Such a singularity is called 
a node. 


More generally: 


Definition 22.20. A variety X := Spec vin - a is smooth (of dimension n—m) if and only if 


at every point P € X(L) for every extension L of k, the Jacobian matrix (24) € Mnxn(L) 


has rank m. (Again it suffices to check P € X(k).) 
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The condition that one of the m x m minors be nonvanishing is exactly the condition in 
the implicit function theorem to guarantee that X is the graph of a differentiable function, 


if we were working over R or C. 


Remark 22.21. One can show that if X is smooth of dimension r, then dim X = r. 


23. PROJECTIVE VARIETIES 


23.1. Motivation. Let O = (0,0,0) € R’®. 
There is a bijection 


the plane z = 1 in R® © {nonhorizontal lines in R® through O} 


POP 
EN{z=1} aL. 


Think of points in the plane z = 1 as being the corresponding lines. Extend the plane 
by introducing new “honorary points” that represent the horizontal lines. They should be 
thought of as being “points at infinity”: for example, as x — oo, the point (x,0,1) tends 
to infinity in a certain direction, and the corresponding line flattens out and approaches the 


U-axis. 


This yields the projective plane P?(R) whose points correspond to arbitrary lines in R? 
through O. 


23.2. Projective space. Let k be any field. Fix n € Zso. Let L be a field extension of k. 
Define an equivalence relation ~ on L"*! — {0} such that 


(do,---;@n) ~ (Diys25 Un) 
if and only if there exists \ € L* such that b; = Aa;. Define 


P"(L) := rntt — {0} 


Here P? is called n-dimensional projective space over k. 
For ao,.--,@n € L not all 0, let (ap : ... : @,) denote the equivalence class of (ao,..., Qn). 
The a; are called homogeneous coordinates of the point. 


The group G;, acts on P"(k), and the fixed subset is P"(k). (This will be assigned for 
homework. ) 


23.3. Projective varieties. It does not make sense to evaluate a polynomial f € k[xo,...,@n| 
at a point in P”(L), because the polynomial has different values at the different represen- 
tatives of the equivalence class. But if f is homogeneous of some degree d, meaning that in 
every monomial of f the exponents of the variables sum to d, then the condition that f be 0 
at a particular point in P”(L) makes sense, since multiplying the homogeneous coordinates 


by multiples the value of f by A7. 
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Given a set T of homogeneous polynomials in k[x71,...,@n]|, define Z = Zr by the rule 
Z(L) :={PeP"(L): f(P) =0 for all f € T}. 


Any such Z is called an projective variety over k. 
An ideal J generated by a set 7 of homogeneous polynomials is called a homogeneous ideal, 
and then Z; := Zr satisfies 


Z1(L) ={P © P"(L): f(P) =0 for all homogeneous f € I}. 


Conversely, given a projective variety Z in P”, its homogeneous ideal J is the ideal generated 
by the set of homogeneous polynomials f such that f(P) = 0 for all P € Z(L) for all L. 


Theorem 23.1. There is an inclusion-reversing bijection 


{radical homogeneous ideals of k[xo,...,@,] not (%o,...,2n)} + {projective varieties Z in PZ} 
Tw Zr 


homogeneous ideal of Z <4 Z. 


If 1 + Z, then 
(1) = Soe tol 
is called the homogeneous coordinate ring and one writes 
Z = Proj id ie 7 = | 


Definition 23.2. A projective variety is irreducible if it satisfies any of the following equiv- 


alent conditions: 


e It cannot be written as a union of two smaller projective varieties. 
e Its homogeneous ideal is a prime ideal in k[xo,..., @p]. 


e Its homogeneous coordinate ring is an integral domain. 
23.4. Projective varieties as a union of affine varieties. 
23.4.1. The standard covering of projective space. There are inclusions 
A? oP? 
(2, y) > (ws y:1) 
and 
Pi cy 
(x: y) +> (x: y:0) 
These copies of A? and P! in P? are complements of each other. (If a point (x : y: z) € P?(L) 


has z #0, the homogeneous coordinates can be scaled in a unique way to get a point of the 


form (x: y: 1).) 
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More generally, inside P”, if 2 € {0,1,...,n}, then the hyperplane H; defined by x; = 0 is 
a copy of P”~', and its complement U;, which consists of points of the form (x : +++: i-1: 
1: 2j41:°++: Zp), is a copy of A”. 

Since every point on P” has at least one nonzero coordinate, Ui, U; = P”. 


23.4.2. Homogenization and dehomogenization of polynomials. Given a polynomial f(x,y) € 
k|x, y], we can make a homogeneous polynomial by multiplying each monomial by a suitable 
power of z. For example, 5x2? + 3y? + xy + 7 becomes 5x?z + 3y? + ryz + 7z3. The process 
can be reversed by setting z = 1. 

In general: 


Definition 23.3. Fixi € {0,1,...,n}. Given f € k[xo,..., i-1, Liz1,..., Ln] of total degree 


d, its homogenization is 
x Se x 
d 0 i-1 i+1 n 
at (Bn. : Aa taeg ). 


i Xi Xi Xi 


Conversely, given a homogeneous polynomial F'(xo,...,2n), its dehomogenization (with re- 
spect to 2x;) is 


F (20, +++) Ui-1, 1, v4, eT: cae 


23.4.3. Affine patches of a projective variety. Let X be a projective variety in P”. Let 
I C klao,...,2,] be its homogeneous ideal. Fix i € {0,1,...,n}. Let J; be the ideal 
of klxo,...,Ui-1, Vit1,---,;%n] obtained by dehomogenizing all homogeneous f € J. Then 
the i'* affine patch of X is the affine variety X NU; = Spec rs We have 
io X NU) = X. 

One thinks of X as being constructed by glueing the affine patches in a particular way. 


(More general varieties and schemes can be constructed by glueing affine varieties in other 
ways.) 


*n] be an affine 


K[toy-+y@i—1 Vid yeees 
Pg 


23.4.4. Projective closure of an affine variety. Let V = Spec 
variety. SoV C A" = U; C P”. The projective closure V of V in P” is the projective variety 
defined by the homogeneous ideal generated by the homogenizations of the f € J. 

If J is generated by one element, it suffices to homogenize that one element. 


Example 23.4. The projective closure of the affine plane curve y? = 2? + 2x +7 in P? is 
the projective variety defined by y?2z = x? + 2xz? + 72°. 


If one starts with an affine variety V and takes its projective closure, one can recover V 
by taking an affine patch. 

But if one starts with a projective variety X, and takes an affine patch X M U;, and then 
takes the projective closure, one could get a smaller variety: one loses irreducible components 


in the hyperplane Hj. 
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23.4.5. Properties of projective varieties. 


Definition 23.5. The dimension of a projective variety is the maximum of the dimensions 
of its affine patches. 


Definition 23.6. A projective variety is smooth if and only if all its affine patches are. 


In fact, one can check whether a point P on a projective variety is singular by checking 
any affine patch containing P. 


Definition 23.7. The function field of an irreducible projective variety is the function field 
of any of its nonempty affine patches. (One can show that this is independent of the patch 
chosen. ) 


24. MORPHISMS AND RATIONAL MAPS 


Definition 24.1. Let X be an irreducible variety, and let Y be a projective variety in P”. 
A rational map f: X --+ Y is an equivalence class of (n + 1)-tuples 


(oe iuyseee 7, 
such that f; € «(X) for all 7, and the f; are not all identically 0, and such that for any field 
extension L D k and any P € X(L) such that the f;(P) are all defined and not all 0, 
(fo(P) : fiCP): +++: fx(P)) € YC). 
The equivalence relation is: 


(fo: fits++: fn) = Afoi +: Afn) 
for any A € K(X)*. Say that f is defined (or regular at a point P € X(L) if there exists 
A € K(X)* such that 
(fo(P) = fiCP) s+: fa(P) 
is defined (i.e., each f; is defined at P functions the f;(P) are all defined 


Definition 24.2. A rational map X --+ Y that is defined at every P € X(L) (for all L D k) 
is called a morphism. 


Example 24.3. The map 
P! +P? 
(a: y) Hs (a? : ey: y’) 
is a morphism. (Strictly speaking, it should be written as (t?.: t : 1) or (1: ¢! : 7%), 


where ¢ is the rational function x/y on P'.) Its image is the projective curve in P? defined 


by 2] = xp. 
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Example 24.4. Consider the unit circle X: 2? + y? = 1 over a field k of characteristic not 
2. Let X be its projective closure. Identify P' with the projective closure of the y-axis. For 
all points P € X(L) other than (—1,0), the line through (—1,0) and P intersects this P! in 
a point Q € P!(L). This construction defines a rational map 


f:X —FP! 


ery (Hi). 


There is an inverse construction: For most points Q € P!(ZL), the line through (—1,0) and 
Q intersects X in one point P other than (—1,0), and this defines a rational map 


giP ox 


Ce Ce ae 
1+t2 1420 


Where are these rational maps defined? The first map can be rewritten as 
(ci:y:z)H (y:2+2) =(a@-—2z:-y). 


The first right hand side makes sense except at (1: 0: —1), and the second right hand side 
makes sense except at (1: 0:1), so it is defined everywhere. 
The second map can be rewritten as 


(a: y) > (22 —y? : Wry: 2? +’). 


which is defined everywhere since x? — y? = x? + y? = 0 implies x = y = 0. 

The composition of the two rational maps in either order is the identity map, so one 
says that the two varieties X and P' are isomorphic: X ~ P!. In particular, for each field 
extension L D k, the set X(L) can be parametrized. 

Taking L = Q gives essentially the well-known parametrization of Pythagorean triples. 


Remark 24.5. Sometimes it happens that there are rational maps X --+ Y and Y --+ X 
whose composition in either order is the identity except that one or both of the maps is not 
defined everywhere. In this case, X and Y are said to be birational, which is weaker than 
being isomorphic. 


25. QUADRATIC FORMS 
In this section, k is a field of characteristic not 2. 


Definition 25.1. A quadratic form over a field k is a homogeneous polynomial ¢(#1,...,2n) € 
k|x1,...,2%n| of degree 2. 


Example 25.2. Over Q, take q(x, y) = 2x? + 5xry — 6y?. 
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A quadratic form gives rise to a function gq: V > k, where V = k”. Since #k > 2, the 
function determines the quadratic form, so they will be identified from now on. 
More abstractly: 


Definition 25.3. A quadratic form on a finite-dimensional k-vector space V is a function 
q: V — k such that for a choice of basis e€1,...,e, of V, the function q(a1e1 + ++: + Lnen) 
from k" > k is given by a quadratic form in the previous sense. 


Definition 25.4. A bilinear form on a k-vector space V is a function 
B:VxV—-ok 
such that the identities B(v; + v2, w) = B(v1,w)+ B(v2, w), B(Av, w) = AB(u, w), Biv, wi t+ 
We) = Biv, wi) + B(v, w2), and B(v, Aw) = AB(v, w) hold (where A € k and everything else 
is a vector in V). A bilinear form is symmetric if 
Biv, w) = B(w,v) 
for all u,w € V. 


For each V, there is a bijection 


{quadratic forms on V} > {symmetric bilinear forms on V } 


q(x + y) — q(x) + a(y) 


qu Bia, y) = 5 


q(x) := Bia, rz) HB. 


These can also be described in matrix form: q(x) = x'Ax and B(x, y) = x' Ay for a unique 
symmetric matrix A; here xv and y are viewed as column vectors, and x’ denotes the transpose 
(a row vector). 


Definition 25.5. The rank of a quadratic form is the rank of the associated symmetric 


matrix A. 


Definition 25.6. The quadratic form q(x,...,%n) is called nondegenerate if any of the 
following equivalent conditions hold: 

e The associated symmetric matrix A is invertible. 

e For each nonzero x € V, the linear map y +> B(z, y) is nonzero. 

e The rank of g equals n. 


25.1. Equivalence of quadratic forms. 
Definition 25.7. Two quadratic forms q(x1,...,%») and q'(#1,...,@n) are equivalent if they 


differ by an linear change of variable: q'(x) = q(Tx) for some invertible matrix T. 
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Example 25.8. Are x? + y? and 5x? + 5y? equivalent over Q? Answer: Yes, because 
(2¢ + y)? + (a — 2y)? = 5a? + By’. 


How about x? + y? and 3y? + 322? It turns out that this time the answer is no. 
It is not so easy to tell when two quadratic forms are equivalent! 


Proposition 25.9. Every quadratic form q(#1,...,%n) over k is equivalent to a diagonal 
quadratic form 


2 2 2 
Q4X} + AQ%y +--+ + AnX,. 


Proof. We use induction on dim V. The cases dim V < 1 are trivial. 

If q = 0, then just take all a; to be 0. Otherwise choose v with g(v) 4 0. Since x +> B(z, v) 
is a surjective linear map V — k, its kernel vt := {x € V : B(z,v) = 0} is of dimension 1 
less than V. Also, v ¢ vt (since q(v) = B(v,v) £0), soV ~ ku @vt. Ify = yy 4+ yo with 
y € kv and y € v~, then q(y) = q(y1) + a(y2) +2B (yr, yo) = a(y1) + 4(y2). By the inductive 
hypothesis, g|,1 can be diagonalized, and q(a,v) is of the form a,x?7, where a; = q(v). 


Remark 25.10. If q is equivalent to a,x? +---+a,x2, then the rank of q equals the number 
of nonzero qj. 


25.2. Numbers represented by quadratic forms. 


Definition 25.11. Let q be a quadratic form on V, and let a € k. Say that q represents a 
if there exists a nonzero x € V such that q(x) = a. 


The condition that x be nonzero matters only when a = 0. In this case it is important to 
include this in the definition, since otherwise every quadratic form would represent 0! 


Example 25.12. The quadratic form x? — 2y? over Q represents —7 but not 0. 


Proposition 25.13. If a nondegenerate quadratic form q represents 0, then it represents 
every element of k. 


Proof. Choose e € V such that q(e) = 0. Since q is nondegenerate, there exists f € V with 
Bie, f) 4.0, and f must be independent of e. Then q(re + yf) = ary + by? = (ax + by)y 
for some a,b € k with a = 2B(e, f) 4 0. For any c € k, we can solve (ax + by)y = c by 
setting y = 1 and solving a linear equation for x. Thus even gq restricted to the subspace 


(e, f) represents c. 


26. LOCAL-GLOBAL PRINCIPLE FOR QUADRATIC FORMS 


Theorem 26.1 (Hasse-Minkowski). A quadratic form over Q represents 0 if and only if it 


represents 0 over Q, for all p < oo. 
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(Actually, this was proved by Minkowski alone. Hasse generalized the theorem to the case 
of quadratic forms over a finite extension of Q.) 


Remark 26.2. The fields Q, and R and F,((t)) and their finite extensions are called local fields, 
because Laurent series fields like C((t)) are describing the expansion of functions around a 


single point. On the other hand, Q and F,(t) and their finite extensions are called global 
fields. Local fields are completions of global fields. 


Here are two variants of the theorem: 


Theorem 26.3. Given a € Q, a quadratic form over Q represents a if and only if it repre- 
sents a over Q, for all p < co. 


Theorem 26.4. Two quadratic forms over Q are equivalent if and only if they are equivalent 
over Q, for all p < oo. 


Corollary 26.5. Let X be a (smooth projective) plane conic over Q (i.e., the zero locus in 
Po of a quadratic form q(x,y,z) that is irreducible even over Q). Then the following are 
equivalent: 


(i) X has a rational point. 
(ii) X has a Q,-point for all p < oo. 
(iii) X ~ Ph. 
Proof. (i) <=> (ii) is Hasse-Minkowski. 
(iii) ==> (i) is trivial. 
(i) => (iii): If X has a rational point P, projection from P defines an isomorphism (the 


argument is similar to the argument for the unit circle). 


Remark 26.6. If X is a smooth projective plane conic over F, then X has an F,-point, 
by the Chevalley-Warning theorem proved in the homework, so X ~ Pr: In particular 


#X (Fy) =qtl. 


Definition 26.7. A variety X over Q is said to satisfy the local-global principle (also called 
the Hasse principle) if the implication 


X has a Q,-point for allp<oo = > X has a Q-point 
holds. 


So plane conics satisfy the local-global principle. Unfortunately, more complicated varieties 
can violate the local-global principle. It is a major problem of arithmetic geometry to 


determine which families of varieties satisfy the local-global principle. 
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26.1. Proof of the Hasse-Minkowski theorem for quadratic forms in 2 or 3 vari- 
ables. Note: In prove the Hasse-Minkowski theorem, we can assume that the quadratic 
form is in diagonal form, and that the first coefficient is 1 (scaling it by a nonzero constant 
does not affect whether it represents 0). We do only the hard direction, in which we assume 
that q represents 0 over Q, for all p < oo, and hope to prove that gq represents 0 over Q. 
First consider the 2-variable case, so g is x? — ay” for some a € Q. To say that x? — ay? 


represents 0 is to say that a is a square. We may assume a # 0. Since q represents 0 over R, 
we have a > 0. Write 
i I] p’P. 


primes p 


Since q represents 0 over Q,, the valuation n, must be even. Since this holds for all p, this 
means that a is a square in Q, so q represents 0. 
The proof in the 3-variable case will use the following lemma. 


Lemma 26.8. Let a,b € k where chark 4 2. Let N: k(.,/a) + k be the norm map: if a 
is not a square in k, then N(x + ya) = x? — ay”. (If a is a square, N(x) := x.) Then 
the quadratic form x? — ay? — bz? over k represents 0 if and only if b = N(a) for some 


a € k(/a). 


Proof. Case 1: a is a square, saya = c?. Then x? — ay? = (x + cy)(x — cy), which is 
equivalent to xy, which represents everything, so x? — ay? — bz? = 0 has a solution with 
z = 1. On the other side, b = N(b). 

Case 2: a is not a square. If b is a norm, say b = N(x + y/a), then x? — ay? —b- 1? = 0. 
Conversely, if x? — ay? — bz? represents 0, the nontrivial solution to x? — ay? — bz? = 0 must 


have z #0. Dividing by z? shows that b is a norm. 


We may assume that our 3-variable quadratic form q is x? — ay? — bz? where a,b # 0. 
Multiplying y or z by an element of Q* changes qg to an equivalent quadratic form, so we 
are free to multiply a and b by squares. Thus we may assume that a and 0 are integers, and 
in fact, squarefree integers (i.e., not divisible by the square of any prime). 

We use strong induction on m := |a| + |B]. 

Case 1: m <2. There are four possibilities: 


a? ty? + 2 
ety 2 
got? 
gy? — 27. 


We are assuming that q represents 0 over R, so the first is actually not possible. In the other 


three cases, g represents 0 over Q, as desired. 
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Case 2: m > 2. Without loss of generality |b] > |a]. So |b] > 2. Write 
b= Epi > + Dp 


where the p; are distinct primes. Let p be one of the p;. By assumption, there is a nontrivial 
solution to x? — ay” — bz* = 0 over Q,, and we may assume that x,y,z € Z, and that not 
all are in pZ». 

We claim that a is a square mod p. If not, then considering 2? — ay? — bz? = 0 modulo p 
shows that z = y =0 (mod p), but then p? divides x? and ay?, so p?|bz?, so p|z?, so p|z, so 
x,y,z € pZ,, a contradiction. 

Since a is a square mod p; for all i, and since Z/bZ = [| Z/p;Z, we have that a is a square 
mod b. So there exists t € Z such that t? = a (mod b). Adjust t by a multiple of b to assume 
that |t| < |b|/2. So 


f=a= bb 
for some b’ € Z. We have 
=e). [EP (al 2 (bl 
b'| = < 1< |b 
ee yea 


since |b| > 2. 
Now 0b’ is a norm of an element of Q(,/a), and hence is a norm from Q,(./a). Lemma 
implies that b too is a norm from Q,(,/a), so b’ = (bb’)/b is a norm from Q,(,/a). Thus 


o=ny =b2 =p 


represents 0 over each Q,,. But |a| + |b’| < |a] + |b] (and it’s even better if you divide b’ by a 
square to get a squarefree coefficient), so the inductive hypothesis implies that it represents 
0 over Q. Thus 0’ is a norm from Q(./a). If b’ = 0, then a is a square, and we are done; 
otherwise b = (bb’)/b! is a norm from Q(,/a), and Lemma[26.8|implies that x? — ay? —bz? = 0 
represents 0). 


27. RATIONAL POINTS ON CONICS 


Consider a projective plane conic ax? + by? + cz? = 0 in P%. Without loss of generality, 


a, b,c are nonzero integers. 


Proposition 27.1. If a,b,c € Z are all nonzero, and p is a finite prime such that p { 2abc, 
then ax? + by? + cz? = 0 has a nontrivial solution over Q,. 


Proof. By the Chevalley-Warning theorem, there exists a nontrivial solution over F,. Lift 


this solution arbitrarily to get (xo, yo, 20) € Zp satisfying ax? + byé + cz? =0 (mod p), with 
Lo, Yo, Zo not all in pZ,. Without loss of generality, suppose that rp) ¢ pZ,. Then x is an 
approximate zero of the polynomial 


f(x) := ax? + bys + cz € Z, [a] 
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and p{ f’(%o) = 2azo, so Hensel’s lemma gives an exact solution x; € Z, to f(r) = 0 with 


41 =X #0 (mod pZ,). So (x1, yo, 20) is a solution to ax? + by? + cz? = 0. 


Example 27.2. For which p < 00 does 2” + y* = 3z? in PG have a Q,-point? 

According to Proposition it automatically has a Q,-point for all p except possibly 
00, 2,3. It has the R-point (V3: 0: 1). 

If there were a Qs-point, it would have the form (x : y : z) with z,y,z € Z3 not all 


divisible by 3. Considering the equation modulo 3 yields x? + y? = 0 (mod 3), which implies 


x = y = 0 (mod 8) since —1 is not a square in F3. But then 3? divides x? + y? = 327, so 
3|z, contradicting the assumption that not all of x,y, z are divisible by 3. Thus there is no 
Q3-point. 

Similarly, a Qo-point would have the form (x: y: z) with x,y,z € Z» not all divisible by 
2. Then 0 = 274+ 7? — 327 = 2? 4+ y? + 27 (mod 4), but squares in Zz are 0 or 1 mod 4, so 
x? +y? + 27 can be 0 mod 4 only if 2|z, y, z, a contradiction. Thus there is no Qo-point. 


Remark 27.3. Using quadratic reciprocity, one can show that for any smooth conic X over 
Q, the number of p < oo such that X has no Q,-point is finite and even! 


28. SUMS OF THREE SQUARES 


Lemma 28.1. A nonzero rational number a is represented by x? + y? + 27 over Q if and 
only if a> 0 anda is not of the form 4”u with u € 7+ 8Z. 


Proof. Since x? + y? + z? represents 0 over Q, for all odd primes p, it also represents a over 


such Q,. It also represents a over R since a > 0. So the question is whether it represents a 


over Q». 

First consider the range of x? + y? + z? where x € ZX and y,z € Zy. The range of 2? is 
1+82Z5, so the range of x?+ y?+ 2? is a union of cosets of 8Z, and we just try all possibilities 
modulo 8. Namely, y? is 0, 1, or 4 modulo 8, and 2? is similar, so the range of x? + y?+ 2? is 
{1,2,3,5,6}+8Z». A general triple (x,y, z) € (Q2)? — {0} is obtained from one as above by 
multiplying by a power of 2 (and permuting the variables), and this multiplies the output 


by a power of 4. 


Theorem 28.2 (Gauss). A positive integer a is a sum of three integer squares if and only 
if it is not of the form 4"(8n + 7) with m,n € Zyso. 


Idea of proof, following Davenport and Cassels. By Lemma [28.1] it suffices to show that for 

a € Zyo, if x? + y?+ 2? =a has a rational solution, then it has an integer solution. The 

idea is this: Given a rational point P on the sphere x? + y? + z? =a, let Q be the nearest 

point with integer coordinates. If P = Q, we are done. Otherwise PO intersects the sphere 

in another rational point R, and the fact that PQ < \/(1/2)? + (1/2)? + (1/2)? < 1 implies 
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(with some work) that denom(R) < denom(P), where denom(P) denotes the lem of the 
denominators of the coordinates of P. 


The three squares theorem has two nice corollaries. 
Corollary 28.3 (Lagrange). Every a € Zso is a sum of four squares. 


Proof. If a is a sum of three squares, let the fourth square be 07. Otherwise a = 4(8n + 7) 
for some m,n € Zso. Write 8n + 6 as a sum of three squares; then 8n + 7 is a sum of four 


squares, and the same is true of a. 


Corollary 28.4 (Gauss). Every a € Zso is a sum of three triangular numbers (i.e., three 
numbers of the form m(m + 1)/2). 


Proof. The key trick: if x = 2m-+1, then 
z2—-1 m(m+1) 
a ae 
By the three squares theorem, 8a + 3 is a sum of three squares: 


ge + oe +23 = 8a+3. 


Considering this equation modulo 4 shows that 21, 22,x73 are all odd. Write x; = 2m; + 1. 


Then 
mi(m, + 1) " mMo(Mz + 1) 4 m3(m3 + 1) 


2 2 2 


=a. 


29. VALUATIONS ON THE FUNCTION FIELD OF A CURVE 
Definition 29.1. A curve is a 1-dimensional variety. 
Let C be a curve over k. Let «(C) be the function field of C. 


Definition 29.2. Let P € C(k). Suppose that C is smooth at P. The local ring Op of C 
at P is the set of functions f € «(C) that are regular (defined) at P. Let mp := {f € Op: 
f(P) = 0}, which is a maximal ideal of Op. 


Example 29.3. Take C = Aj. Let P be the origin. Then 


—— : p(t), q(t) € k[t] and q(t) is not the zero polynomial} =? ke) 


fe 
Or = {20 ;q(o) 40} 
{x 


KR oS 
Na Se | a | 


= 


q 
PX") . (0) = 0 and q(0) 4 0} 


= 


YS» —@ma 


q 
au : p(0) £ 0 and q(0) zo}. 
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Every f € «(C)* can be uniquely written as t’u where n € Z and u € OF. The map 
up: K(C) > ZU {+00} 
f=H=tMunn 
0 ++ +00 


is a valuation on K(C). We have 


Also, up(t) = 1. 
All of this generalizes to any smooth curve. 
Theorem 29.4. Let C be a smooth curve. Let P € C(k). Then there is a valuation 
up: K(C) > ZU {+00} 


such that 


Definition 29.5. Say that f has a zero of multiplicity m at P if up(f) =m > 0, and a pole 
of multiplicity m at P if vp(f) = —m < 0. 


Definition 29.6. An element t € K(C) such that uvp(t) = 1 is called a uniformizing parameter 
at P. 


If t is a uniformizing parameter at P, then every f € K(C)* can be uniquely written as 
tu, where n € Z and u € OF. Namely, n = vp(f). 
Over a field like R, the implicit function theorem shows that the part of the curve near P 


is the graph of an analytic function of t, so the different values of t near t = 0 parametrize 
the points of C' near P. 


Remark 29.7. Suppose that C is the curve f(x,y) = 0 in A?, and (a,b) € C(k) is a smooth 


point on C, so either 2£ (a, b) £0 or f(a, b) £0 (or both). 


e If 2£ (a, b) # 0 (so the tangent line is not vertical), then x — a is a uniformizing 
parameter. 


e If (a, b) £0, then y — b is a uniformizing parameter. 
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Example 29.8. Let C be the curve y? = x? —x. Let P = (0,0). At P, the rational function 
y is a uniformizing parameter. So vp(y) = 1. What is vp(x)? We have x = y? (>4,), and 
= € OF (it and its inverse are both defined at P), so up(x) = 2. 


29.1. Closed points. If k is not algebraically closed (but still perfect), then we will want 
to define valuations at more than just the k-points. Going back to the example of Aj, 
the valuation at a k-point was measuring the exponent of t — a in the factorization of a 
rational function. But we should also measure the exponent of each other monic irreducible 
polynomial p(t) in k{t]. The zero set of any such p(t) is an irreducible subvariety of Aj, but 
when considered over k it breaks up as a G;-orbit of points in A1(h). 

In general, a closed point of a variety X is a 0-dimensional irreducible subvariety. If 
X = Speck[ti,...,tn]/Z, then closed points of X are in bijection with maximal ideals of 
klt,,...,t,|/I. If k is algebraically closed, then the closed points are the same as elements 
of X(k). For an arbitrary perfect field k, the closed points of X are in bijection with the 
G,-orbits of points in X(h). 

If P is a closed point of a curve C’ over k, one can define Op and mp as before. The residue 
field «(P) := Op/mp turns out to be a finite extension of k, and deg P := [k(P) : k] is called 
the degree of P. If moreover X is a curve C, and C is smooth at P (which is the same as 
saying that C over k is smooth at any of the k-points into which P breaks up), then there 
is also a valuation vp with the same properties as in the case where P € C(k). 

Working with closed points is an alternative to working with L-points for all (finite) 
extensions L of k. 


30. REVIEW 


e Absolute values, archimedean vs. nonarchimedean 
e Valuations 

e Ostrowski’s theorem 

e Cauchy sequences 


Completion 

e Z, as inverse limit 

e Q, = Frac Z,, or Q, as completion of Q 
e Hensel’s lemma 

e Structure of Z> and Q° 

e Squares in QF 

e p-adic power series 


Algebraic closure 


Finite fields, Frobenius automorphism 


Inverse limits 


Profinite groups, open and closed subgroups 
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e Properties of fields and extensions of fields: normal, separable, perfect, Galois 

e Infinite Galois groups as profinite groups, absolute Galois group 

e Infinite Galois theory 

e Affine varieties, affine coordinate ring 

e Projective varieties, homogeneous coordinate ring 

e Irreducibility and function field 

e Dimension 

e Smoothness 

e Homogenization, dehomogenization, projective closure, affine patches 

e Rational maps, morphisms 

e Quadratic forms, bilinear forms 

e Rank, nondegenerate, equivalence 

e Local-global principle for quadratic forms (Hasse-Minkowski theorem); applications 
to rational points on conics 


e Valuations on a curve, local ring, maximal ideal, uniformizing parameter 


31. CURVES AND FUNCTION FIELDS 


Theorem 31.1. /f 6: C --+ X is a rational map from a smooth irreducible curve to a 
projective variety, then is a morphism (1.e., ¢ is actually defined everywhere). 


Proof. It suffices to check that @ is defined at each closed point P. Suppose that X C P” 


and that @ is given by (fo: ---: fn). Let f be the f; such that uvp(f;) is minimum. Then 
(fo: --: : fn) is equivalent to (4 pene fy) but vup(f;/f) > 0 for all 7 so the functions 


f;/f are defined at P, and their values are not all 0 since f;/f = 1. So we get a morphism 
o: C — P", and in fact it maps into X, because the locus in C’ where the image satisfies 
the equations of X in P” is a subvariety of C containing infinitely many k-points. (Every 
subvariety of C’ other than C' itself is 0-dimensional, and hence a finite union of closed points, 


which contains only finitely many k-points.) 


Example 31.2. If C is not smooth, Theorem can fail: 


{fy =a(2+1)} 3A! 


y 
Ly 
(x,y) a 


gives a rational map between the projective closures that is not defined at the singularity 


(0,0). Over R, this map cannot even be extended to a continuous function. 
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Example 31.3. Similarly, 


{y? = 2°} > At 
(z,y) 4 2 
a 


gives a rational map between the projective closures that is not defined at the singularity 
(0, 0). 


An irreducible curve that is not necessarily smooth or projective is birational to a curve 
that is smooth and projective. (The analogue for higher-dimensional varieties is an unsolved 
problem in the case where char k > 0! This is called resolution of singularities.) 


Definition 31.4. A variety over k is nice if it is smooth, projective, and geometrically 
irreducible (i.e., irreducible even when considered over k). This is not universally accepted 


terminology, but it is convenient! 
Fact: A nonconstant morphism of curves @: C’ — C defines a field homomorphism 
K(C) > K(C") 
fre foo 


in the opposite direction, and this makes «(C’) a finite extension of «(C’). Define the degree 
of ¢ to be the degree of this extension, i.e., deg := [K(C") : K(C)]. 


Remark 31.5. If k is algebraically closed, then any nonconstant morphism C’ —+ C' induces 
a surjection C’(k) + C(k), and for all but finitely many P € C(k) the number of preimages 
of P in C’(k) equals the separable degree of «(C’) over «(C). In particular, this gives an 
alternative definition of deg ¢, at least if «(C’) is known to be separable over «(C) (for 
example, if char k = 0). 


Say that a field extension K of k is a 1-dimensional function field over k if K is a finite 
extension of a rational function field k(t) and K contains no nontrivial finite extension of k. 


Theorem 31.6. Then there is an equivalence of categories 


nonconstant morphisms field homomorphisms acting as the identity on k 


nice curves over k, 1-dimensional function fields over k, : 


C H KC). 
(The °? indicates that morphisms give rise to field homomorphisms in the opposite direction.) 


The proof is involved, so we will skip it. 
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Example 31.7. Let Co be the affine curve y? = x° — x over Q. Its projective closure is a 


nice curve C’. The map 
Co > Al 
(z,y)Oa 


extends to a morphism 
CoP! 


and the reverse map of function fields is the inclusion 


ee _ I_y 
k(x) — Frac Goan. Key) = hove i. 


which is a degree 2 extension of fields. 
32. DIVISORS 


Definition 32.1. A divisor is a formal sum 5> 
P, and np = 0 for almost all P. 


Aone Pee npP such that np € Z for all 


In other words, a divisor on C is a formal integer linear combination of (finitely many) 
closed points. 


Example 32.2. Let C be the projective curve x? + y? = z? over Q. Let P= (1:0: 1). Let 
Q =(8:4:5). Then 2P — 3Q is a divisor. 


Definition 32.3. The divisor group Div C’ is the group of all divisors on C' under addition. 


In other words, Div C is the free abelian group having as basis the set of closed points of 
i. 

There is a partial order on DivC: namely, ‘>npP > 5>mpP means that np > mp for 
all P. 


Definition 32.4. A divisor D = > npP is called effective if D > 0 (i.e., np > 0 for all P). 


A divisor D can be written as D,; — Dz where D; and Dz are effective divisors. Moreover, 
this representation is unique if we also insist that D, and D2 have “disjoint supports”. 


32.1. Degree of a divisor. Recall that if P is a closed point on C, then deg P is defined 
as the degree of k(P) := Op/mp as a field extension of k. 


Definition 32.5. If D = S>npP is a divisor on C, then the degree of D is defined as 
deg D := )) np(deg P). 


Example 32.6. Suppose that k is algebraically closed. Then each finite extension «(P) of 
k must equal k, so each closed point P has degree 1 (they are just the elements of C(k)). 


Thus if D = )>npP, then deg D = So np. 
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Example 32.7. In the 77+ y? = z? example above, the degree of the divisor 2P — 3Q is —1. 
The map 


Div’ += Z 
Dw deg D 


is a group homomorphism. 
Its kernel, the subgroup of divisors of degree 0, is denoted Div? C. 


32.2. Base extension. 


Definition 32.8. If X is a variety over a field k, and L is a field extension of k, then the 
base extension X,, is the variety defined by the same polynomial equations as X but with 
the polynomials viewed as polynomials with coefficients in Z (even though the coefficients 
are actually in the subfield k). 


A common case is where k is a perfect field and L = k is an algebraic closure of k. 


Example 32.9. If X = Spec pret, then Xp = Spec EH, Similarly, if Y = 


a2+y?—1)? x2+y2—1)° 
Proj wee. then Yg = Proj ess, 


If P is a closed point of C, then its base extension (to k) consists of a finite set of closed 
points P,,...,P,, of Cz, where n = deg P. Define a homomorphism 


DivC > Div CZ 


by mapping each closed point P of C to the corresponding sum P; +...+ P,, and extending 
linearly (i.e., extend so as to get a homomorphism). 


Example 32.10. Suppose that C is Pz. Then Cc is Pé. Closed points on C other than 
the point (1:0) “at infinity” are closed points in Az, which correspond to monic irreducible 


polynomials in R{t]. Each such polynomial has degree 1 or 2, and that degree is the degree 
of the closed point. The base extension of a closed point other than (1 : 0) is a set of 1 or 2 
points in C(C) corresponding to the zeros of the monic irreducible polynomial. 


Proposition 32.11. The homomorphism 
Div C' > Div CE 
is injective, and its image is the subgroup of G,-invariant elements of Div Cf. 


Sketch of proof. This follows from the description of a closed point of C' as a G,-orbit of 


elements of C(k). 
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Example 32.12. Let C be 2? + y? = 1 over Q. Let P = (1/2, V3/2) and Q = (1/2, V3/2); 
these are points in C(Q). Even though P and Q are not individually elements of C(Q), their 
sum P+ Q is a Gg-invariant divisor, so it comes from a closed point of C’. Namely, it comes 
from the closed point defined by the equations x? + y? = 1 and x = 1/2, that is, the closed 


This is a closed point of degree 2, with residue field Q(V/3). 


point Spec pes 


32.3. Principal divisors. Suppose that C' is a nice curve over k. Let f € K(C)* bea 
rational function on C’. Then the divisor of f is the divisor 
dvf=(f)= So w(f)P. 
closed points PEC’ 


Implicit in this definition is the proposition (which we assume without proof) that for any 
f € «(C)*, there are only finitely many P such that vp(f) = 0. 


Definition 32.13. A divisor is called principal if it equals (f) for some f € K(C)*. 


The map 
K(C)* > DivC 
fo (f) 


is a homomorphism, and its image is the set of principal divisors. This shows that the set 
of principal divisors is a subgroup of Div C. 


Example 32.14. If C = Pj, then «(C) is the rational function field k(t). Let P be a closed 
point of C, and let p(t) be the corresponding monic irreducible polynomial. If f € «(C)*, 
then vp(f) is measuring the exponent of p(t) in f. Thus the divisor of f is keeping track of 
the complete factorization of f. In other words it measures the zeros and poles of f with 
multiplicity, with poles giving a negative coefficient. 


Remark 32.15. For any rational function f € «(C’)*, if we write the principal divisor (f) as 
D, — Dz where D, and Dz are effective with disjoint supports, then the following positive 
integers are equal: 

e The degree of the rational map C — P! given by (f : 1); 

e deg D,, which is the number of zeros of f counted with multiplicity; and 

e deg Dz, which is the number of poles of f counted with multiplicity. 


Remark 32.16. Every principal divisor is of degree 0: that is, deg(div f) = 0 for every 
f Een(C)*. 


(We will not prove these last results, but you proved the last fact for C = P; on your last 


homework assignment. ) 
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32.4. Linear equivalence and the Picard group. 


Definition 32.17. Divisors D, and Dz are called linearly equivalent if there exists f € «(C)” 
such that D, — Dz = div(f). (Write D, ~ Dy in this case.) 


Linear equivalence is an equivalence relation. Each equivalence class |D] is called a divisor 
class. Because the set of principal divisors is a subgroup of Div C, the set of equivalence 
classes is the quotient group 

DivC 


Fie? = 
mc {principal divisors}’ 


which is called the Picard group of C’. Since Div C is abelian, so is its quotient PicC. 


Example 32.18. Let C = P;. Two divisors on C are linearly equivalent if and only if they 
have the same degree. In other words, PicC ~ Z. (You proved this in your last homework 


assignment. ) 


In general, for any nice curve C' over k, there is an exact sequence 


0 k* > &(C)* > DivC > PicC > 0. 


Remark 32.19. In more advanced algebraic geometry courses, one shows that divisor classes 
are in bijection with isomorphism classes of line bundles, which, loosely speaking, are families 
of vector spaces in which one has one vector space for each point of C. 


Because principal divisors are of degree 0, the degree homomorphism 
DivC 4 Z 
Dw deg D 
factors through the quotient Pic C: i.e., it induces a well-defined homomorphism 
Pic 4 Z 
[D] + deg D. 
Its kernel, consisting of divisor class of degree 0, is denoted Pic? C. 
Example 32.20. Let EF be the projective closure of the affine curve Ep in AS given by 
y? = «(a — 1)(x — 7). 
We will show that Pic FE contains an element of order 2. 
The projective closure is given by the equation 
y’z = x(x — z)(x — 7z) 


in P9. If we intersect with the “hyperplane at infinity” z = 0 in P?, we find that x = 0 too, 


so the point oo := (0: 1: 0) is the unique point on F not contained in the affine patch Eo. 
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What is the divisor of the rational function 
x € K(Eo)* = K(E)*? 


On Eo, the function x vanishes only at P := (0,0). But (2) must have total degree 0, 


so (x) = nP — noo for some positive integer n. To find n, we compute up(x). Since 
Z(y? — a(x —1)(a—7)) £0 at P, the function y is a uniformizer at P. Then 
1 2 
v= ; 
(e—D(«—7)” 


and the first factor is a unit at P, so vp(x) = 2. Thus 
(CS 2P Dee; 


Let D = P —oo. Then 2D is principal, so [D] € Pic E satisfies 2|D] = 0 in Pic E. 
How do we show that |D] itself is not 0 in Pic E? In other words, how do we show that D 
is not principal? 
One way: if D = (f), then deg f = 1 (the number of zeros of f), so [k(E) : «(P')] = 1, so 
E is birational to P', which means that E ~ P' (since E and P! are nice curves). But E(R) 


has two connected components, and P'(R) has only 1! 


Another way: For simplicity, we can base extend to C (if D = (f) for some f € K(E)*, 
then D viewed in Div Ec is principal too, the divisor of the same f). Redefine EF’ as the 
base extension Ec. The function field «(£) is C(x) (./x(x — 1)(x — 7)), which is a quadratic 
extension of C(x). The nontrivial automorphism o (of order 2) of «() fixing C(x) induces 


an automorphism 0: E — E whose restriction to Ep is the morphism (x,y) > (#,—y). 
This o induces an automorphism of Div FE, namely }>npP > Y\np(*P). In particular, if 
(f) =D, then 
"= DSP Awe aco Sf); 

which implies that °f = cf for some c € k*. Applying o shows that f = c’f. Thus 
f=ccf)=Cf,soc=+1. If c=1, then f is in the fixed field k(E)’, so f € C(x). If 
c = —1, then f/y is in the fixed field, so f = g(x)y for some rational function g € C(x)”. 
For each a € C, the divisor of x — a is 


(a, /a(a — 1)(a — 7)) + (a, —V/a(a — 1)(a — 7)) — 200 


(if a ¢ {0,1,7} then xz — a is a uniformizer at each of the first two points and has no other 


zeros or poles except at oo; if a € {0,1,7} use an argument as for x above). And taking the 
divisors of both sides of the equation 
y= ale —1)(a—7) 


shows that 


(y) = (0,0) + (1,0) + (7,0) — 300. 
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The rational function f in C(x)* or C(x)*y is a product of these times a nonzero constant, 
so in the principal divisor (f), the multiplicities of (0,0) and (1,0) have the same parity. In 
particular, D — oo cannot equal (f). 


33. GENUS 


(In this section you will be asked to take many things on faith, even more than usual.) 

If C is a nice curve over C, then C(C) can be viewed as a topological space, and it turns out 
to be a 1-dimensional compact complex manifold; i.e., a compact Riemann surface. By the 
classification of compact oriented surfaces, it is homeomorphic to a sphere with g handles, 
i.e., a g-holed torus, for some nonnegative integer g. This integer g is called the genus of C. 

In fact, there also exist algebraic definitions of the genus, in terms of “differentials”, and 
these apply to nice curves over any field, even fields of characteristic p. 

The genus of a nice curve is unchanged by base extension. 


33.1. Newton polygons of two-variable polynomials. 


Definition 33.1. A lattice point in the plane R? is an element of Z?. 


Definition 33.2. A convex lattice polygon P in R? is the convex hull of a finite subset of 
Z”. (Loosely speaking, you put a rubber band around the points.) We (re)define the length 
of a side of P as n — 1, where n is the number of lattice points on the side including the 
endpoints. 


Suppose that C is a nice curve birational to an affine plane curve f(x,y) = 0, where 
eo) = aay € k[z, yl. 
tJ 
Let P be a convex lattice polygon containing {(i,j) € Z? : a;; # 0}. For instance P could 
be the Newton polygon of f, defined as the convex hull of {(7,7) € Z? : ai; 4 O}. 

Given a side s of P, choose a direction along it, and label its lattice points 0, 1, ..., 4, 
where @ is the length of s; now form the homogeneous polynomial f,(t, u) of degree £ whose 
+1 coefficients are the coefficients of f corresponding to the lattice points on s in order 
(choose one of the two possible directions along s). We call f, a side polynomial (this is not 
standard terminology). 


Theorem 33.3. Let f = > ajjx'y’ and P be as above. Suppose that 


(i) The affine curve f(x,y) =0 is smooth. 
(ii) For each side s of P, the side polynomial f, is squarefree. 


Then the genus of C equals the number of lattice points in the interior of P. 


Remark 33.4. The zero polynomial is not squarefree. Thus the condition on the side poly- 


nomials will be satisfied usually only if P is close to being the Newton polygon on f. 
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34. RIEMANN-ROCH THEOREM 
Definition 34.1. Given D € Div C, define 
L(D) := {f € n(C)* : (f) + D > OFU {0}. 
Proposition 34.2. The set L(D) is a k-subspace of K(C). 


Proof. Suppose that D = S*npP. To say that f € L(D) is to say that vp(f) > —np for 
all P. Each condition vp(f) > —np defines a set of f that contains 0 and is closed under 
addition and multiplication by constants in k, 

so each condition defines a subspace Vp of K(C). Then L(D) = (\pVp, so L(D) is a 
subspace too. 


Example 34.3. If D = 0, then L(D) is the set of f € K(C) such that (f) > 0. But for 
nonzero f, the divisor (f) has degree 0, so (f) > 0 is possible only if (f) = 0, which holds 
when f € k*. Thus L(D) =k. 


Example 34.4. If D = 2P for a closed point P, then L(D) is the set of f € K(C) with at 
most a double pole at P (i.e., a double pole, simple pole, or defined at P), and defined at 
all other closed points of C. If D = 3P — 2Q, for closed points P and Q, then L(D) is the 
set of f € K(C) with at most a triple pole at P, and with at least a double zero at Q. 


If Dy < Dog, then L(Dj) © L(D3). 


Example 34.5. Let C = P! 5 A! = Speck[t]. Let oo € P'(k) be the point outside this At, 
SO Up(t) = —1, and more generally v..(p(t)) = —degp for any polynomial p(t) € kt]. Let 
D = 30. 

What is L(300)? If f = un € L(300), where p(t) and q(t) are nonzero relatively prime 
polynomials in k(t), then q(t) cannot have a zero at any closed point P of A!, because at 
any such zero we would get up(f) < 0, so (f) + 300 would not be effective. Thus q(t) is a 
constant, and we may assume q(t) = 1. Thus f = p(t) is a polynomial in t. The condition 
(f) + 800 > 0 implies v.(f) > —3, which says that — deg p(t) > —3, so deg p(t) < 3. Thus 
L (300) is the k-vector space of polynomials in k[t] of degree at most 3. In particular, L(300) 
has basis 1, t, t?, t?, so dim, L(300) = 4. 

Let P € A!(k) be the point where t takes the value 7. What is L(30o — P)? This is 
the subspace of L(300) consisting of polynomials that have at least a simple zero at P, or 
equivalently, that are divisible by t—7. Thus L(300—P) = {(t—7)g(t) : g(t) € k[t], deg g(t) < 
2}, which is a 3-dimensional k-vector space. 


It turns out that dim, L(D) is always finite. 


Definition 34.6. For each D € Div C, define (D) := dim, L(D) € Zso. 
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Example 34.7. If D =0, then L(D) =k, so ((D) =1. 
Proposition 34.8. [f deg D < 0, then L(D) = {0} and €(D) =0. 


Proof. Suppose that deg D < 0. If f € L(D) — {0}, then (f) + D > 0. The divisor (f) has 
degree 0, so (f) + D has the same negative degree as D. On the other hand, if (f) + D > 0, 
then (f) + D has nonnegative degree. This contradiction shows that no such f exists. 


Proposition 34.9. If D and D’ are linearly equivalent, then ((D) = ¢(D'). 


Proof. Write D = D’ + (g) for some g € K(C)*. If f € L(D) is nonzero, then (f) + D > 0, 
so (f)+D'+(g) > 0, so (fg) + D’ > 0, so fg € L(D’). Thus multiplication-by-g maps L(D) 
into L(D’), and does so injectively, since multiplication-by-g on «(C) is injective. Similarly, 
multiplication-by-g~' maps L(D’) into L(D). These maps define inverse isomorphisms of 
k-vector spaces between L(D) and L(D’). In particular, their dimensions ((D) and ¢(D’) 
are the same. 


Theorem 34.10 (Riemann-Roch). Let C be a nice curve of genus g over k. There exists a 
divisor class consisting of divisors K called canonical divisors such that 

€(D) — &k — D) =degD+1-g 
for all D € DivC. 


The Riemann-Roch theorem is rather deep, so we will not prove it here. From now on, 
denotes any fixed canonical divisor. 


Corollary 34.11. 
(i) (K) =g. 
(ii) deg K = 2g — 2. 
(iii) If deg D > 2g — 2, then €(D) =degD+1-g. 
Proof. 
(i) Taking D = 0 in the Riemann-Roch theorem yields 
1-2¢(kK) =04+1-g, 
so. LK) = g. 
(ii) Taking D = K yields 
g—-l=degK+1-g 
so deg K = 2g — 2. 
(iii) If deg D > 2g — 2, then deg(K — D) < 0 s0 &(K — D) =0 by Proposition [34.8] So the 


Riemann-Roch theorem simplifies to 


&(D) =degD+1-—-g. 
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Example 34.12. Let C = P! 5 A! = Speck[t]. For d > 0, we have 

L(doo) = {p(t) € it] : deg p(t) < d}, 
so ((doo) = d+ 1. On the other hand, when d is sufficiently large, then Corollary |34.11{c) 
implies that €(doo) = d+1-—g. Thus g = 0. (This agrees with the fact that P'(C) is 
topologically a sphere, which is of genus 0.) If D € DivP! is of degree d > —2, then 


Corollary |34.11(c) implies that @(D) = deg D + 1; alternatively, use that D ~ doo to obtain 
((D) = €(doo) = d+ 1. To summarize, for any D € Div P! of degree d, we have 


((D) = 0 if d < 0 (by Proposition |34.8) 
d+1 ifd>0. 


The same conclusion holds for any genus 0 curve C, by a similar argument. 


Proposition 34.13. If C is a nice curve of genus 0 over k, and C(k) is nonempty, then 
Cer. 


Proof. Choose P € C(k). By Corollary [34.11{c), OP) = 1 = 2, be 10) = 1 as im 
Example |34.7| so there exists f € L(P) — L(0). Since L(0) = k, this means that f is a 
nonconstant function with a simple pole at P and no other poles. The number of poles of 
f is 1, so the degree of the morphism C — P! given by (f : 1) equals 1. In other words, 


C — P' is a birational map, and hence an isomorphism. 


35. WEIERSTRASS EQUATIONS 
From now on, k is a perfect field of characteristic not 2 or 3. 
Definition 35.1. A (short) Weierstrass equation is a polynomial equation of the form 
y=x2+Ar+B 


for some constants A,B € k. (If chark were 2 or 3, we would instead consider long Weier- 
strass equations of the form 


2 3 2 
yo + a1 LY + A3Y = LX + Agx” + Aad + 46, 


but when chark 4 2,3, we can complete the square in y to make a, = 0 and ag = 0, and 
then complete the cube in x to make az = 0.) 


Proposition 35.2. Let E be the projective closure in P? of the affine curve Ep defined by a 
Weierstrass equation y? = 23+ Ax+B. Then the following are equivalent: 
(i) The affine curve Eo is smooth. 


(ii) The projective curve E is smooth. 
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(iii) The polynomial x? + Ax + B is separable (or equivalently, squarefree). 

(iv) The discriminant —16(4A? + 27B?) is nonzero. 

If these conditions hold, then E is a nice genus 1 curve with a single point P := (0: 1: 0) 
at infinity. Otherwise, E has a unique singular point, and E is birational to P!, which is of 


genus O. 


Proof. This will be assigned for homework. If FE is smooth, the fact that its genus is 1 follows 


from the genus formula (d — 1)(d — 2)/2 for a smooth plane curve of degree d. 


36. ELLIPTIC CURVES 


Definition 36.1. An elliptic curve over k is a nice genus-l curve over k equipped with a 
point in E(k) (called the origin). 


Theorem 36.2. 


(i) Given a Weierstrass equation y? = 234+ Arz+B with 23+Ax+B separable, the projective 
closure of this affine curve, equipped with the point (0: 1:0), is an elliptic curve over 
ke 

(ii) Every elliptic curve over k is isomorphic to one arising in this way. 


Proof. 
(i) This follows from Proposition [35.2| 
(ii) Let E be an elliptic curve, so F is of genus 1. Let P € E(k) be the origin of EF. By 
Corollary |34.11(c), we have 0(nP) =n for all n > 1. So we have bases as follows: 


1(0) = (1) = 6, 
L(P) = (1), 
LP y= (1 @) for some x € K(E), 
LP \= (Lay) for some y € K(E). 
In particular, vp(x) = —2 and vp(x) = —3. Then vp(x?) = —4, so x? € L(4P)—L(3P). 
Thus 
L(4P) = (1,2,y,2°). 
Similarly, 


L(5P) = al x, Y; x”, Ly). 
Now the 7 functions 1, 2, y, x”, ry, 2°, y” in the 6-dimensional vector space L(6P) must 
be linearly dependent, and the relation must involve both x? and y? since both of these 
have valuation —6 at P. By replacing x,y by Ax, Ay for suitable A € k*, we may 
assume that the relation takes the form of a long Weierstrass equation. By completing 


the square and cube, we may make it a short Weierstrass equation instead. Let C' be 
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the nice curve birational to the one given by this Weierstrass equation. Then «(C’) = 
k(x, y) C «(E). Since 


[A(E) : k(a)| = deg x = #{poles of x, counted with multiplicity} = 2 


and |[K(E) : k(y)| = 3 are relatively prime, k(x, y) = «(£). Thus E£ is birational to C. 
So C has genus 1, so x? + Ax + B is separable by Proposition [35.2| 


37. GROUP LAW 
Theorem 37.1. Let E be an elliptic curve with origin O. Then the map of sets 
E(k) 6 Pic? E 
PH [P-O 
is a bijection. 
Proof. Injectivity: Suppose that P,Q € E(k) are distinct points such that |[P—O] = |Q-—O}]. 
Then P—Q = (f) for some f € K(E)*. This f is of degree 1, so (f : 1) defines an isomorphism 
E — P', contradicting the assumption that E has genus 1. 
Surjectivity: Let [D] € Pic? E, where D € Div? E. Then £(D+0O) = 1, so L(D+0) # {0}, 


so there exists f € «(E£)* such that (f)+D+0O > 0. But deg((f)+D+0O)=04+0+1=1, 
so (f) + D+O = P for some P € E(k). Thus [D] = [P — O]. 


Since Pic? E is an abelian group, the bijection above makes E(k) into an abelian group. 


37.1. Chord-tangent description. Let E C P? be an elliptic curve in Weierstrass form. 
Let L C P? be a line. Then LM E can be computed by changing coordinates on P? to make 
L given by z = 0, and then substituting z = 0 into the degree 3 homogeneous polynomial 
defining E to get a degree 3 homogeneous polynomial in k[x, y], and looking at its zeros on 
P! ~ L. The result is three k-points, if we count them with appropriate multiplicities. More 
precisely, we may view LM E as a divisor of degree 3 on E. 


Example 37.2. If L is the line at infinity, given by z = 0, then LM E gives the divisor 3-O 
since substituting z = 0 into 

yz = 2° + Azz? + Bz 
yields x° = 0. 


Let ZL, and Ly, be two lines in P?, defined by linear forms ¢; and (2, respectively. View 
f := 6/2 as a rational function on E. Then one can show that 


where the intersections are viewed as degree 3 divisors on E as above. In particular, if 
Le is the line at infinity, and 2,9 E = (P) + (Q) + (R), where P,Q,R € E(k), then 
(f) =(P) + (Q) + (RK) - 3-0 =(P—-O)+(Q-0)+ (R-O). 
Proposition 37.3. Let E C P? be an elliptic curve in Weierstrass form, and let O = (0: 
1:0), as usual. Then 

(i) The point O is the identity for the group law on E(k). 

(ii) If P,Q,R © E(k) are such that there is a line L with LO E = (P)+(Q)+(R), then 

P+Q+R=O in the group E(k). 
Proof. (1) The point O € E(k) corresponds to [O — O] € Pic? E. 
(2) The sum P+Q-+R in E(k) corresponds to [P — O] + [Q — O] + [R— O], which as 
explained just before this proposition, is the class of a principal divisor. 


Proposition characterizes the group law on E(k) completely: 

e To compute the inverse of a point P = (a,b) € E(k) not equal to O, let L C P? be the 
projective closure of the vertical line x = a in A?; then LANE = (P)+(P’)+(O), where 
P’ := (a,—b). (LZ passes through O since its homogeneous equation is x — az = 0, 
which vanishes at (0: 1 : 0)); thus according to Proposition [37.3}fii), P+P'+0O0=0, 
so P’ = —P. Of course, Proposition [37.3]{i) we also know that —O = O. 

e To compute P+ Q where P,Q € E(k), first let L be the line in P? through P and Q; 
if P = Q, take L to be the tangent line to F at P. Then LN FE = (P)+(Q)+(R) 
for some R € E(k) (it is a k-point because its degree must be 1; more concretely, it 
is so because if two roots of a cubic polynomial are rational, then the third root is 
rational too). By Proposition [37.3}fiip , P+Q+R=0O,s0 P+Q = —R, which can 
be determined, as we already saw. 

In fact, it is possible to define a product variety E x E, an addition morphism E x FE > E, 
and an inverse morphism EF > E. 


37.2. Torsion points. 


Definition 37.4. Let E be an elliptic curve over k. Let P € E(L) for some field extension 
DLDk. Let n € Zsy. Call P an n-torsion point if nP = O in the group E(L). The n-torsion 


subgroup E[n] of E(k) is the kernel of the multiplication-by-n homomorphism 


[n]: E(k) > E(k) 
PwunP. 
Example 37.5. Assume chark 4 2. Let E be the projective closure of y? = f(a) where 


f(x) is a separable cubic polynomial. Then E|2] consists of O and the three points (a, 0) 
where a € k is a zero of f. Thus E[2] ~ Z/2Z x Z/2Z. 
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Assume moreover that f(x) = (x—e1)(x—e2)(x—es3) for distinct e1, e2,e3 € k. Then E[2] C 
E(k). Consider the multiplication-by-2 morphism on FE and the corresponding extension of 
function fields. 


E L:=«(E) 
| ] 
E K :=K(E) 


Each T € £[2] induces an addition-by-T morphism tr: E — E, a deck transformation of 
the top FE (i.e., an automorphism satisfying [2](7r(P)) = [2](P)), and this corresponds to 
an automorphism of L acting trivially on K. In fact, we get an injective homomorphism 
E|2] — Aut(L/K). 

In fact, it turns out that [2]: E > E is a morphism of degree 2? = 4 (that is, [L : K] = 4), 
so L/K is a Galois extension with Galois group E[2]. One way to prove this is to compute 
degrees of all the morphisms in the diagram 


[2] 
E — +E 


es 
Pt ee Pp! 
where x is the projection onto the x-coordinate, and ¢(x) is the rational function giving 


x((2|P) for P = (x,y): an explicit calculation of the tangent line for y? = 73+ Ax + B gives 


vt — 2Axr? — 8Bx + A? 
£4 O(a) = a((]P) = ASS 


which is a rational function of degree max(4,3) = 4. For a more conceptual proof that 
deg[2] = 4, using differentials and dual isogenies, see [Sil92]. 

Since Gal(L/K) ~ Z/2Z x Z/2Z, there are three intermediate quadratic fields, and it 
turns out that these are K(,/x — e;) for i = 1,2,3. Note that (a — e1)(a — e2)(x — e3) is 
already a square in K, namely y?. So L = K(,/z — €1, \/z — €2). 


38. MORDELL’S THEOREM 


In a 1901 paper, Poincaré considered the problem of finding generators for the group E(Q) 
for an elliptic curve FE over Q. It was only many years later, in 1922, that Mordell proved 
the existence of a finite set of generators. He used an argument resembling the “method 
of infinite descent” used by Fermat to prove that 2+ + y* = 2? has no solutions in positive 
integers. 


Theorem 38.1 (Mordell). [f E is an elliptic curve over Q, then the abelian group E(Q) is 


finitely generated. 
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By the structure theorem for finitely generated abelian groups, Mordell’s theorem implies 
that E(Q) ~ Z x T for some nonnegative integer r (called the rank of £) and some finite 
abelian group T (called the torsion subgroup of E). 


Remark 38.2. Mordell’s theorem is sometimes also called the Mordell-Weil theorem, but Weil’s 
contribution was to generalize it by replacing Q with an arbitrary finite extension of Q and 
E by an abelian variety of arbitrary dimension. 


All known proofs of Theorem are minor variants of the one we will give. It consists 
of two parts. The first part is the following: 


Theorem 38.3 (Weak Mordell-Weil theorem). If E is an elliptic curve over Q, then E(Q)/2E(Q) 
as finite. 


The second part involves the construction of a function h: E(Q) — R called a height func- 
tion. For P € E(Q), the value h(P) measures the size of the numerators and denominators 
of the coordinates of P. 


Remark 38.4. It is not known whether there exists an algorithm that takes EF’ as input and 
outputs a finite list of points that generate E(Q). The problem is that the proof of the weak 
Mordell-Weil theorem is not effective; i.e., it does not produce coset representative for the 
elements of E(Q)/2E(Q), even in principle. 


39. THE WEAK MORDELL-WEIL THEOREM 


In this section we will prove the weak Mordell-Weil theorem in the case that E[2] C E(Q), 
i.e., the case in which EF is given by an equation of the form 


If we make the substitution x = x'/d? and y = y'/d? and multiply both sides by d°, we get 
an isomorphic curve; moreover, by choosing d so the denominator of each e; divides d, the 
new curve is of the same form but with e; € Z. So assume that e; © Z from now on. 


Lemma 39.1. We have an isomorphism of abelian groups 
Q* 
Q*? 


a (or 


= Homeonts(Go, TEL?) 
"Va 
%) 


(Here, for each a € Q%, we write @ for its image in Q* /Q*?, and V/a for a fixed square root 


of a in Q*. The notation Homconts denotes the group of continuous homomorphisms. ) 
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Proof. First let us show that a > “ve is a homomorphism: If 0,7 € Gog, then 


“We "ve (“WY 

Ja va Va 

"Ya "Va 
Ja Va’ 

7 ya 


since the number ae +1 is fixed by a. It is a continuous homomorphism, since its kernel 


is the closed subgroup Gg ya) of Gg. Next, this homomorphism is independent of the choice 


of \/a, since changing the sign of \/a does not change the ratio Se Thus we have a well 


defined map of sets 
Q* 4 Homeonts(Gq, {£1}). 


If a,b € Q%, and we choose square roots \/a and Vb, and use Ja: Vb as a square root of ab, 
we get 

Jab _ *Jarvi 

Vvab a Vb" 


so O is a homomorphism. We have 


aéker(0) = °Va=Va for alla € Go 
= VaEeQ 


<=> a € Q*, 


Thus O induces a homomorphism 


Q* 
Q*?2 


—, Homconts(Go, {£1}). 


Given a nontrivial element @ € ae we get a well-defined quadratic extension L := Q(,/a). 
Given a quadratic extension L of Q, we get a nontrivial continuous homomorphism Gg — 
Gal(L/Q) ~ {+1}. The composition of these constructions is 0. Moreover, each construction 
can be reversed: Given a nontrivial continuous homomorphism Gg ~ {+1}, its kernel is a 
closed subgroup of index 2 in Gg, which by Galois theory is Gz for some field L of degree 2 
over Q. And given a quadratic extension L of Q, we may write L = Q(,/a) where a € Q” 
is uniquely determined modulo squares. This completes the proof that our homomorphism 


is an isomorphism. 


There is a partial analogue in which the multiplicative group of a field is replaced by an 
elliptic curve: 
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Lemma 39.2. For any elliptic curve E over Q such that E|2| C E(Q), there is an injective 


homomorphism 
E(Q) 


2E(Q) 
PH (oH ’Q-Q). 


aed Homeonts(Go, E[2}) 


(Here, for each P € E(Q), we write P for its image in E(Q)/2E(Q), and Q for a point in 
E(Q) such that 2Q = P.) 


Proof. By Remark [31.5] a choice of Q exists for each P. If o € Gg, then 2-7Q = 7(2Q) = 
°P = P, so 2°Q —Q) = P— P=0,s0°Q-Q € E[2]. The proof that oH "Q-Q 
is a homomorphism is the same as in Lemma 39.1} it is here that we use that E[2] is fixed 
pointwise by every o € Gg. The rest of the proof also copies the proof of Lemma [39.1] 


Remark 39.3. There is a generalization of Lemma that works even if E[2] is not con- 
tained in E(Q). It involves replacing Hom¢onts(Go, E[2]) by a continuous cohomology group, 
H' (Gg, E[2)). 


Proposition 39.4. For any elliptic curve E over Q such that E|2] C E(Q), there is an 
injective homomorphism 
x x 
EQ »# Q , @ 
2E(Q) QQ? Q 
If P = (2, y) = E(Q) _ {O, (en, 0); (e2,0)}, then 


@(P) = (x — e1, 2 — 9). 


Also, ¢(O) = (1,1) and 


b((e1, 0)) = ((e1 — €2)(€1 — €3), 1 — 2) 
b((€2,0)) = (e2 — e1, (€2 — €1) (€2 — €3)) . 


Remark 39.5. More canonically, we have an injective homomorphism 


Eq) (g3)" > 3) 


(x,y) > (a@ — €1, 4 — €2, © — €3) (for (x,y) € E(Q) — E[2)). 


For (z,y) € E[2] — {O}, two of the x — e; make sense, and the third can be assigned the 
value such that the product is 1. This explains the last formulas in Proposition [39.4] 


Sketch of proof. The fact that this defines a homomorphism can be checked with a brute 


force calculation. 
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But it really comes from Lemma [39.2] plus the isomorphism 
E{2] > {+1} x {+1} 
(e1,0) + (—1,1) 
(€2,0) + (1, -1), 
plus Lemma 39.1 It is saying that in order to take half of a point (x, y) € E(Q) — E[2], one 
must adjoin ,/x — e; and \/x — €2 to the ground field. 


Final exam on Mon Dec 14, 9am-12 in 3-135. It will be mainly based on topics covered in 
homework problems. Remaining office hours this week: Wed 1:30-2:30, Fri 12:30-1:30. 
Challenge problems: Show that every nice genus 2 curve over a field of characteristic not 


2 is birational to an affine curve y? = f(x) with f(x) separable of degree 5 or 6. 
What can you say about explicit equations of genus 3 curves? 
Compute £(Q)/2E(Q) for E: y? = x? — x. Can you determine E(Q) itself? 


Proposition 39.6. Let S be the set of primes p such that p|(e; — e;) for some distinct i, 7. 
Let Q(S, 2) be the finite subgroup of Q* /Q*? generated by (the images of) —1 and the primes 
in S. Then the image of the injective homomorphism 
E x x 
(Q) 6 Q@  Q 
2E(Q) QQ? Q” 
is contained in Q(S,2) x Q(S, 2). 


Sketch of proof. Suppose P = (x,y) € E(Q). For simplicity, let us assume that P ¢ EQ]. 
To say that x — e; € Q(S,2) is to say that v,(x# — e1) is even for every prime p ¢ S. Fix 
pE€S. 


Case 1: v,(z) <0. Then v,(x% — e;) = v,(x) for i = 1, 2,3. Now 
2up(y) = Up(y’) 
= vp((x — €1)(# — €2)(# — €3)) 
= Up(£ — €1) + Up(Z — €2) + Up(% — €3) 
= 3u,(2), 
so U,(a) is even. 
Case 2: v,(x) > 0. Then p divides at most one of x — e;, x — eg, x — €3, because otherwise 


subtracting would show that p divides some e; — e;, so p € S, a contradiction. On the other 
hand, vp((x — e1)(a — e2)(a — e3)) is even, as in Case 1, so v,(x — e;) must be even for each 


t. 


Proposition [39.6] proves that E(Q)/2E(Q) injects into a finite group; this proves the weak 


Mordell-Weil theorem (at least for elliptic curves EF over Q with E[2] C E(Q)). 
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40. HEIGHT OF A RATIONAL NUMBER 


Definition 40.1. Let ¢ = a/b be a rational number in lowest terms. The (exponential) height 
of t is 
H(t) = max((al,|6). 
Extend the definition to t € P!(Q) = QU {oo} by defining H(oo) = 1. 
Definition 40.2. The (logarithmic) height of t € QU {oo} is 
Alt) slop A(t). 
Example 40.3. We have h(100) < h(1001/1000). 


In general, h(t) is approximately the width of a piece of paper needed to write down t 
explicitly as a fraction of integers. 


Proposition 40.4 (Northcott). For any bound B € R, the set {t € Q: H(t) < B} is finite. 


Proof. Each t in this set has numerator in the range |—B, B| and denominator in [1, B], so 
there are at most (2B + 1)B possibilities. 


Challenge problem: Find an asymptotic formula for the size of this set as B — oo. 


Definition 40.5. The degree of a rational function p(x)/q(x) € Q(x) in lowest terms is 
max(deg p, deg q). 


Theorem 40.6. Jf f(x) is a rational function of degree d, then h( f(t) 


) = dh(t) + Of(1) for 
allt € Q. (That is, there is a constant C = C(f) such that |h(f(t)) — dh(t) 


| for allt € Q.) 


Proof. Write f(z) = p(x)/q(x), where p(x), q(x) € Za] have ged 1. 

Upper bound: Write t = a/b in lowest terms. Let P(x,y) = y?p(x/y) and Q(z,y) = 
y?q(x/y) be the homogenizations of p(x) and q(x), respectively. Then f(t) = f(a/b) = 
P(a,b)/Q(a,b). This might not be in lowest terms, but in any case 

H(f(t)) < max(|P(a, 6)|, |Q(a,0)|) < Cmax(|al, |b])" = CH(t)* 
for some constant C' depending on P and Q (i.e., on f). Taking log of both sides yields 
h( f(t) < dh(t) + log. 

Lower bound: We must bound |a| and |b] in terms of |P(a, b)| and |Q(a, b)|. Example: If 
P(a,b) = 3a? + b? and Q(a, b) = ab, we could use the identities 

aP(a,b) — bQ(a,b) = 3a® 
bP(a, b) — 3aQ(a, b) = b. 


In particular, 


gcd(P(a, b), Q(a, b))| ged(3a®, b*)|3 
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so P(a, b)/Q(a, b) is almost in lowest terms, so 


A(f(t)) = H(P(a,b)/Q(a, b)) ~ max(|P(a, b)|, |Q(a, b)]), 
where ~ means up to a bounded constant factor. Also, 
3|a|? < max(|al, |b]) max(|P(a, b)|, |Q(a, b)]) 
Jb)? < max(|a], |b]) max(|P(a, 6)|, |Q(@, 6))), 
sO 


max(|al, |b|)° < 


x(a], [b]) max(|P(a, 6)|, |Q(a, 6))) 
max(|a|, |b|)” < max(|P(a, 6)|, |Q(a,d)]) 
H(t)? < H(f(t)) times a constant. 


2h(t) < A(f(t)) + OC). 
To generalize to arbitrary P(a,b) and Q(a,b), we need the two identities. Observe that 


ma. 
ma. 


P(a,b) and Q(a,b) have no common zeros in Q except (0,0). So the Nullstellensatz implies 
that the ideals (P(a, b), Q(a, b)) and (a, b) of Qla, b] have the same radical. In particular, for 
some n, we have that a” and 6" lie in the ideal generated by P(a,b) and Q(a,b) in QJa, 8). 
Clearing denominators shows that there exists c € Zs; such that the same holds for ca” and 
cb” in Z{a, b]. 


41. HEIGHT FUNCTIONS ON ELLIPTIC CURVES 


Recall that we are studying the elliptic curve with equation 


y” = (w — e1)(x — e2)(x — €3). 


Without loss of generality, by making the substitution 7 +> x + c for some c € Q, we may 
assume that the coefficient of x? in the right hand side is 0. And then, as before, we may 
also assume that e; € Z for all i. Now the right hand side is also x? + Ax + B for some A 
and B. 


Definition 41.1. For P € E(Q), define 
he(P) = h(e(P)) = log H(#(P)). 
(By convention, h,(O) = 0.) 
Proposition 41.2. For all P € E(Q), we have 
h,(2P) = 4h,(P) + Oxz(1) 


where the bound on the error term depends only on E, not on P. 
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Proof. We claim that there is a rational function r(x) of degree 4 such that if P = (x,y), 
then «(2P) = r(x). This can be deduced by coordinate geometry, by using the chord-tangent 


law: one gets 
x* — 2Az* —8Br + A? 


oe 4(x3 + Ax + B) 


Alternatively, the diagram of curves 


[2] 
& — +E 


Pp! a Pp! 
induces a diagram of function fields 


Q(z, y) an Q(xe, yo) 


| | 


Q(x) ——  Q(#2), 
in which the coordinate functions x2 and yo on the FE on the right pull back to the functions 
x(2P) and y(2P) in Q(z, y). Since [2]: E — E is of degree 2? = 4, computing degrees of all 
field extensions in the diagram shows that 2degr = 4-2, so degr = 4. This completes the 
second proof of the claim. 

Now, taking the height of both sides of 7(2P) = r(z) yields 
h,(2P) = h(r(x)) = 4h(x) + Og(1), 
by Theorem [40.6] (The function r(x) depends on E, so the O(1) depends on E too.) 


Lemma 41.3. Given that E has equation y? = «3+ Ax +B with A,B € Z, Every rational 
point on E other than O has the form (+, =) for some a,b,d € Z@ with gcd(a,d) = 1 and 
eca(b,d) = 1; 
Proof. Any (x,y) € E(Q) — {O} satisfies 
yi =a? +Ar+B. 
Taking denominators of both sides shows that 
denom(y)? = denom()?, 


where denom(z) denotes the positive integer denominator when z is written in lowest terms. 
(Another way to see this: The equation implies that v,(y) < 0 if and only if v,(a) < 0, and 
in that case, 2vp(y) = 3u,(a).) This implies that there exists d > 1 such that 


denom(y) = d? and denom(a) = d?. 
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Lemma 41.4. If P,Q © E(Q) — {O} satisfy x(P) 4 x(Q), then 


2(P)+2(Q)+2(P+Q) = (45) 7 


Proof. Let y = ma + b be the line through P and Q, so 
_ Ga =) 
«(Q) — «(P) 
That line intersects FE in three points: P, Q, and R, say. Then R= —(P+Q). Also, x(P), 
x(Q), x(R) are the solutions to the cubic equation 


(ma +b)? = 2° + Ax + B, 
or equivalently, 
g°® — mz? + (A —2mb)z + (B — 6b?) =0, 


so x(P)+2(Q)+2(R) = m?. Substitute the value of m, and observe that «(R) = 2(P+Q) 
since R = —(P + Q). 


Proposition 41.5. Fix Po € E(Q). Then for every P € E(Q), 
h,(P + Po) < 2he(P) + Oz,p,(1). 


Proof. We may assume that Py #4 O. By increasing the constant, we can ignore any finite 
set of P, and hence assume that P is not O or +Po. Write 


a b 
as in Lemma Similarly, write 


ag b 
Po = (Xo, Yo) — (3. 3] 
0 “O 


Then 


2 
z(P+ Po) = (Z i) —X—2Xo. 


«L— Xo 
If we expand the square, replace y? by x? + Az + B, and replace y? by 73 + Aro + B, then 
we eventually get 
(t%q + A)(e + Xo) + 2B — 2yyo 
(x — 2)? 
_ (aa + Ad?d?) (ad? + aod”) + 2Bd*d5 = 2bdbodo 
(ad? — agd?)? , 


Examining each monomial in the numerator and denominator shows that 


«(P+ Po) = 


H(x(P + Po)) = Ox,r,(1) max {|a]°, Jad", |d|*, [bd] } 
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We have 
ja] < H(z) 
|d"| < H(z) 
and the equation y? = #3 + Ar + B so b? = a? + Aad* + Bd®, so 
Jb)? < Ox, (1) H(2)°. 
Plugging these estimates in yields 
H(a(P + Po)) = On,p(1) H(z)’. 
Taking log of both sides gives 
hy(P + Po) < 2h,(P) + Oz,p,(1). 


42. DESCENT 


It is traditional to define the naive height on the abelian group G := E(Q) by the formula 


h(P) = she(P). 


By Propositions]41.5}/41.2 and/40.4| respectively, h: G — R satisfies the following axioms: 
(i) For each Po € G, we have h(P + Py) < 2h(P) + Op,(1) for all P € G. 

(ii) We have h(2P) = 4h(P) + O(1) for all P. 

(iii) For each B € R, the set {P € G: h(P) < B} is finite. 


Proposition 42.1. If G is any abelian group such that G/2G' is finite, and h: G > R is 
any function satisfying (i) and (i), then there exists B > 0 such that G is generated by 
{PEG:h(P) < B}. So if h also satisfies (iii), then G is finitely generated. 


Proof. Let R be a set of coset representatives for G/2G. We will apply (i) only to Py € R, 
and R is finite, so all the O(1)’s are uniformly bounded. 
Given Qo € G, we may write Qo = 2Q, +r; for some Q; € G and r; € R; then 


4h(Q1) + O(1) = A(2Q1) < 2h(Qo) + O(1), 
So , ; 
A(Qi) < 5/t(Qo) qO(1).< 3/*(Qo): 


if h(Qo) is sufficiently large. Choose B so that this holds whenever h(Q)) > B. Let S := 
{P €G:h(P) < B}. We may increase B if necessary to assume that R C S. Let (S) be 
the subgroup of G generated by S. 
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We claim that (S) = G. Suppose that Qo € G. If h(Qo) > B, write Qo = 2Q1 +11 as 
above. If h(Q,) > B, repeat the process to write Q1 = 2Q2+ 12, and so on. Since the height 
is shrinking by a constant factor each time, eventually we reach a Q, with h(Q,) < B, ie., 
with Q, € S. (This is “Fermat’s method of infinite descent”!) Now Qn_1 = 2Qn+1n € (S), 
and Qn—-2 = 2Qn-1+Tn-1 € (S), and so on, until we show that Qo € (S). This holds for 
every Qo, so (S) =G. 


The weak Mordell-Weil theorem combined with the fact that h: E(Q) — R satisfies the 
hypotheses of Proposition [42.1] proves that E(Q) is finitely generated. 


43. FALTINGS’ THEOREM 


The following was conjectured by Mordell in 1922, proved by Faltings in 1983, and reproved 
by a different method by Vojta in 1991. 


Theorem 43.1. Let X be a nice curve of genus g > 1 over Q. Then X(Q) is finite. 


Both proof methods are very difficult. With a lot of work, each can be used to get an 
upper bound on #X(Q), but neither gives a method to determine X (Q) explicitly. 
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